Back to Search
Start Over
Efficient Algorithms for Mining Erasable Closed Patterns From Product Datasets
- Source :
- IEEE Access, Vol 5, Pp 3111-3120 (2017)
- Publication Year :
- 2017
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2017.
-
Abstract
- Finding knowledge from large data sets to use in intelligent systems becomes more and more important in the Internet era. Pattern mining, classification, text mining, and opinion mining are the topical issues. Among them, pattern mining is an important issue. The problem of mining erasable patterns (EPs) has been proposed as a variant of frequent pattern mining for optimizing the production plans of factories. Several algorithms have been proposed for effectively mining EPs. However, for large threshold values, many EPs are obtained, leading to large memory usage. Therefore, it is necessary to mine a condensed representation of EPs. This paper first defines erasable closed patterns (ECPs), which can represent the set of EPs without information loss. Then, a theorem for fast determining ECPs based on dPidset structure is proposed and proven. Next, two efficient algorithms [erasable closed pattern mining (ECPat) and dNC_Set based algorithm for erasable closed pattern mining (dNC-ECPM)] for mining ECPs based on this theorem are proposed. Experimental results show that ECPat is the best method for sparse data sets, while dNC-ECPM algorithm outperforms ECPat algorithm and a modified mining erasable itemsets algorithm in terms of the mining time and memory usage for all remaining data sets.
- Subjects :
- erasable closed pattern
General Computer Science
Computer science
02 engineering and technology
computer.software_genre
Set (abstract data type)
Text mining
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
General Materials Science
Representation (mathematics)
Data mining
Structure (mathematical logic)
erasable pattern
Data stream mining
business.industry
Sentiment analysis
General Engineering
Intelligent decision support system
Data set
020201 artificial intelligence & image processing
Algorithm design
lcsh:Electrical engineering. Electronics. Nuclear engineering
business
lcsh:TK1-9971
computer
pattern mining
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 5
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....c0f0b65bf899224058e50ba7ad5c6049
- Full Text :
- https://doi.org/10.1109/access.2017.2676803