Content area

Abstract

Bloom filters are widely used in modern data systems to accelerate membership tests in databases, storage engines, and networking stacks. On modern CPUs, Bloom filter performance is typically limited by memory latency rather than computation: each query issues k seemingly random probes over a bit array that is often much larger than the last-level cache (LLC). Most prior work on making Bloom filters faster has focused on changing the data structure itself to align with the modern cache hierarchy. We take a complementary approach and ask how far one can go by changing only the query strategy.

First, we show that reordering the k probes for each query–either by fully sorting them (cache-oblivious) or by partitioning them around a cache-sized pivot (cache-aware)–transforms the aggregate access distribution from uniform to strongly skewed toward low indices, greatly improving cache reuse without modifying the Bloom filter representation. Second, we refine the hash generation process to behave more like a low-discrepancy sequence over the filter, reducing the fraction of queries whose probes all fall outside the LLC-sized region.

Across a range of filter sizes for negative-query workloads, our designs reduce LLC-load misses by up to 8.3× and improve end-to-end runtime by up to 1.45× relative to a conventional Bloom filter, while remaining competitive with specialized cache-aware structures such as split-block Bloom filters.

Details

1010268
Title
Cache-Efficient Bloom Filters Designs for 2035’s Memory Hierarchies
Number of pages
58
Publication year
2025
Degree date
2025
School code
0041
Source
MAI 87/6(E), Masters Abstracts International
ISBN
9798270247027
Committee member
Kaminsky, Michael
University/institution
Carnegie Mellon University
Department
Information Networking Institute
University location
United States -- Pennsylvania
Degree
M.S.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32401470
ProQuest document ID
3286141490
Document URL
https://www.proquest.com/dissertations-theses/strong-cache-efficient-bloom-filters-designs-2035/docview/3286141490/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic