Grokking System Design Fundamentals
Ask Author
Back to course home

0% completed

Introduction to Bloom Filters

Background

If we have a large set of structured data (identified by record IDs) stored in a set of data files, what is the most efficient way to know which file might contain our required data? We don't want to read each file, as that would be slow, and we have to read a lot of data from the disk. One solution can be to build an index on each data file and store it in a separate index file. This index can map each record ID to its offset in the data file. Each index file will be sorted on the record ID

.....

.....

.....

Like the course? Get enrolled and start learning!