Back to Search Start Over

A Multi-partitioning Approach to Building Fast and Accurate Counting Bloom Filters

Authors :
Institute of Computing Technology [Biejing] (ICT) ; Chinese Academy of Science (CAS)
Computer and Communication School ; Hunan University
Laboratoire d'Informatique, Systèmes, Traitement de l'Information et de la Connaissance (LISTIC) ; Université de Savoie
Computer Science and Engineering Dept. (MSU CS) ; Michigan State University
Huang, Kun
Zhang, Dafang
Xie, Gaogang
Kavé, Salamatian
Liu, Alex
Li, Wei
Institute of Computing Technology [Biejing] (ICT) ; Chinese Academy of Science (CAS)
Computer and Communication School ; Hunan University
Laboratoire d'Informatique, Systèmes, Traitement de l'Information et de la Connaissance (LISTIC) ; Université de Savoie
Computer Science and Engineering Dept. (MSU CS) ; Michigan State University
Huang, Kun
Zhang, Dafang
Xie, Gaogang
Kavé, Salamatian
Liu, Alex
Li, Wei
Source :
2013 IEEE 27th International Symposium on Parallel & Distributed Processing (IPDPS),; 2013 IEEE 27th International Symposium on Parallel & Distributed Processing (IPDPS),, May 2013, Boston, United States. IEEE, pp.1159-1170

Abstract

International audience<br />Bloom filters are space-efficient data structures for fast set membership queries. Counting Bloom Filters (CBFs) extend Bloom filters by allowing insertions and deletions to support dynamic sets. The performance of CBFs is critical for various applications and systems. This paper presents a novel approach to building a fast and accurate data structure called Multiple-Partitioned Counting Bloom Filter (MPCBF) that addresses large-scale data processing challenges. MPCBF is based on two ideas: reducing the number of memory accesses from k (for k hash functions) in the standard CBF to only one memory access in the basic MPCBF-1 case, and a hierarchical structure to improve the false positive rate. We also generalize MPCBF-1 to MPCBF-g to accommodate up to g memory accesses. Our simulation and implementation in MapReduce show that MPCBF outperforms the standard CBF in terms of speed and accuracy. Compared to CBF, at the same memory consumption, MPCBF significantly reduces the false positive rate by an order of magnitude, with a reduction of processing overhead by up to 85.9%.

Details

Database :
OAIster
Journal :
2013 IEEE 27th International Symposium on Parallel & Distributed Processing (IPDPS),; 2013 IEEE 27th International Symposium on Parallel & Distributed Processing (IPDPS),, May 2013, Boston, United States. IEEE, pp.1159-1170
Notes :
Boston, United States, 2013 IEEE 27th International Symposium on Parallel & Distributed Processing (IPDPS), English
Publication Type :
Electronic Resource
Accession number :
edsoai.ocn893166617
Document Type :
Electronic Resource