Par-BF: A parallel partitioned Bloom filter for dynamic data sets
The International Journal of High Performance Computing …, 2016•journals.sagepub.com
Compared with a hash table, a Bloom filter (BF) is more space efficient for supporting fast
matching through a controllable and acceptable false positive probability. The space size of
the basic BF is predetermined based on the expected number of elements to be stored.
However, we cannot predict the space scale of a BF for dynamic sets. It is still challenging for
the two existing solutions, scalable BF (SBF) and dynamic BF (DBF), to manipulate dynamic
data sets with low memory overhead but achieving high performance. This article presents a …
matching through a controllable and acceptable false positive probability. The space size of
the basic BF is predetermined based on the expected number of elements to be stored.
However, we cannot predict the space scale of a BF for dynamic sets. It is still challenging for
the two existing solutions, scalable BF (SBF) and dynamic BF (DBF), to manipulate dynamic
data sets with low memory overhead but achieving high performance. This article presents a …
Compared with a hash table, a Bloom filter (BF) is more space efficient for supporting fast matching through a controllable and acceptable false positive probability. The space size of the basic BF is predetermined based on the expected number of elements to be stored. However, we cannot predict the space scale of a BF for dynamic sets. It is still challenging for the two existing solutions, scalable BF (SBF) and dynamic BF (DBF), to manipulate dynamic data sets with low memory overhead but achieving high performance. This article presents a partitioned BF (Par-BF) for dynamic data sets. Compared with DBF and SBF, Par-BF is able to leverage a sweet spot between high performance and low overhead by a group of formulas to support fast concurrent matching. Specifically, the size and the range of the false positive probability in Par-BF can be deliberately derived. From our trace-driven experimental results, the input/output operations per second of Par-BF outperforms that of DBF and SBF by 10× to 14× and by 3× to 8×, respectively. Besides, through our proposed garbage collection policy, Par-BF consumes less than half of the memory usage of SBF.
Sage Journals
Showing the best result for this search. See all results