Back to Search Start Over

Mining frequent patterns with the pattern tree

Authors :
Hao Huang
Richard Relue
Xindong Wu
Source :
New Generation Computing. 23:315-337
Publication Year :
2005
Publisher :
Springer Science and Business Media LLC, 2005.

Abstract

Mining frequent patterns with a frequent pattern tree (FP-tree in short) avoids costly candidate generation and repeatedly occurrence frequency checking against the support threshold. It therefore achieves much better performance and efficiency than Apriori-like algorithms. However, the database still needs to be scanned twice to get the FP-tree. This can be very time-consuming when new data is added to an existing database because two scans may be needed for not only the new data but also the existing data. In this research we propose a new data structure, the pattern tree (P-tree in short), and a new technique, which can get the P-tree through only one scan of the database and can obtain the corresponding FP-tree with a specified support threshold. Updating a P-tree with new data needs one scan of the new data only, and the existing data does not need to be re-scanned. Our experiments show that the P-tree method outperforms the FP-tree method by a factor up to an order of magnitude in large datasets.

Details

ISSN :
18827055 and 02883635
Volume :
23
Database :
OpenAIRE
Journal :
New Generation Computing
Accession number :
edsair.doi...........c30d3b576120274aa67ae4835847b643
Full Text :
https://doi.org/10.1007/bf03037636