Back to Search Start Over

A Reinduction-Based Approach for Efficient High Utility Itemset Mining from Incremental Datasets.

Authors :
Sra, Pushp
Chand, Satish
Source :
Data Science & Engineering; Mar2024, Vol. 9 Issue 1, p73-87, 15p
Publication Year :
2024

Abstract

High utility itemset mining is a crucial research area that focuses on identifying combinations of itemsets from databases that possess a utility value higher than a user-specified threshold. However, most existing algorithms assume that the databases are static, which is not realistic for real-life datasets that are continuously growing with new data. Furthermore, existing algorithms only rely on the utility value to identify relevant itemsets, leading to even the earliest occurring combinations being produced as output. Although some mining algorithms adopt a support-based approach to account for itemset frequency, they do not consider the temporal nature of itemsets. To address these challenges, this paper proposes the Scented Utility Miner (SUM) algorithm that uses a reinduction strategy to track the recency of itemset occurrence and mine itemsets from incremental databases. The paper provides a novel approach for mining high utility itemsets from dynamic databases and presents several experiments that demonstrate the effectiveness of the proposed approach. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
23641185
Volume :
9
Issue :
1
Database :
Complementary Index
Journal :
Data Science & Engineering
Publication Type :
Academic Journal
Accession number :
176340362
Full Text :
https://doi.org/10.1007/s41019-023-00229-4