Back to Search
Start Over
Statistically Significant Discriminative Patterns Searching
- Source :
- DaWaK 2019-21st International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2019-21st International Conference on Big Data Analytics and Knowledge Discovery, Aug 2019, Linz, Austria. pp.105-115, ⟨10.1007/978-3-030-27520-4_8⟩, Big Data Analytics and Knowledge Discovery ISBN: 9783030275198, DaWaK
- Publication Year :
- 2019
- Publisher :
- HAL CCSD, 2019.
-
Abstract
- Discriminative pattern mining is an essential task of data mining. This task aims to discover patterns which occur more frequently in a class than other classes in a class-labeled dataset. This type of patterns is valuable in various domains such as bioinformatics, data classification. In this paper, we propose a novel algorithm, named SSDPS, to discover patterns in two-class datasets. The SSDPS algorithm owes its efficiency to an original enumeration strategy of the patterns, which allows to exploit some degrees of anti-monotonicity on the measures of discriminance and statistical significance. Experimental results demonstrate that the performance of the SSDPS algorithm is better than others. In addition, the number of generated patterns is much less than the number of other algorithms. Experiment on real data also shows that SSDPS efficiently detects multiple SNPs combinations in genetic data.
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
Anti-Monotonicity
[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]
business.industry
Computer science
Pattern recognition
Machine Learning (stat.ML)
02 engineering and technology
Discriminative patterns
Machine Learning (cs.LG)
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
ComputingMethodologies_PATTERNRECOGNITION
Discriminative model
Statistics - Machine Learning
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Enumeration
020201 artificial intelligence & image processing
Statistical Significance
Discriminative Measures
Artificial intelligence
[INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC]
business
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-030-27519-8
- ISBNs :
- 9783030275198
- Database :
- OpenAIRE
- Journal :
- DaWaK 2019-21st International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2019-21st International Conference on Big Data Analytics and Knowledge Discovery, Aug 2019, Linz, Austria. pp.105-115, ⟨10.1007/978-3-030-27520-4_8⟩, Big Data Analytics and Knowledge Discovery ISBN: 9783030275198, DaWaK
- Accession number :
- edsair.doi.dedup.....7e3e7e05026c94b70c06874647c399ce