Back to Search Start Over

Efficient Algorithms for Mining Erasable Closed Patterns From Product Datasets

Authors :
Bay Vo
Giang Nguyen
Tzung-Pei Hong
Tuong Le
Source :
IEEE Access, Vol 5, Pp 3111-3120 (2017)
Publication Year :
2017
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2017.

Abstract

Finding knowledge from large data sets to use in intelligent systems becomes more and more important in the Internet era. Pattern mining, classification, text mining, and opinion mining are the topical issues. Among them, pattern mining is an important issue. The problem of mining erasable patterns (EPs) has been proposed as a variant of frequent pattern mining for optimizing the production plans of factories. Several algorithms have been proposed for effectively mining EPs. However, for large threshold values, many EPs are obtained, leading to large memory usage. Therefore, it is necessary to mine a condensed representation of EPs. This paper first defines erasable closed patterns (ECPs), which can represent the set of EPs without information loss. Then, a theorem for fast determining ECPs based on dPidset structure is proposed and proven. Next, two efficient algorithms [erasable closed pattern mining (ECPat) and dNC_Set based algorithm for erasable closed pattern mining (dNC-ECPM)] for mining ECPs based on this theorem are proposed. Experimental results show that ECPat is the best method for sparse data sets, while dNC-ECPM algorithm outperforms ECPat algorithm and a modified mining erasable itemsets algorithm in terms of the mining time and memory usage for all remaining data sets.

Details

ISSN :
21693536
Volume :
5
Database :
OpenAIRE
Journal :
IEEE Access
Accession number :
edsair.doi.dedup.....c0f0b65bf899224058e50ba7ad5c6049
Full Text :
https://doi.org/10.1109/access.2017.2676803