Back to Search
Start Over
Mining frequent patterns with the pattern tree
- Source :
- New Generation Computing. 23:315-337
- Publication Year :
- 2005
- Publisher :
- Springer Science and Business Media LLC, 2005.
-
Abstract
- Mining frequent patterns with a frequent pattern tree (FP-tree in short) avoids costly candidate generation and repeatedly occurrence frequency checking against the support threshold. It therefore achieves much better performance and efficiency than Apriori-like algorithms. However, the database still needs to be scanned twice to get the FP-tree. This can be very time-consuming when new data is added to an existing database because two scans may be needed for not only the new data but also the existing data. In this research we propose a new data structure, the pattern tree (P-tree in short), and a new technique, which can get the P-tree through only one scan of the database and can obtain the corresponding FP-tree with a specified support threshold. Updating a P-tree with new data needs one scan of the new data only, and the existing data does not need to be re-scanned. Our experiments show that the P-tree method outperforms the FP-tree method by a factor up to an order of magnitude in large datasets.
- Subjects :
- Association rule learning
Computer Networks and Communications
business.industry
Computer science
Data needs
Pattern recognition
Data structure
computer.software_genre
Theoretical Computer Science
Tree (data structure)
Hardware and Architecture
Factor (programming language)
Artificial intelligence
Data mining
business
computer
Software
computer.programming_language
Subjects
Details
- ISSN :
- 18827055 and 02883635
- Volume :
- 23
- Database :
- OpenAIRE
- Journal :
- New Generation Computing
- Accession number :
- edsair.doi...........c30d3b576120274aa67ae4835847b643
- Full Text :
- https://doi.org/10.1007/bf03037636