Back to Search Start Over

Classification and variable selection using the mining of positive and negative association rules.

Authors :
Do Van, Thanh
Nguyen, Giap Cu
Thi, Ha Dinh
Ngoc, Lam Pham
Source :
Information Sciences. Jun2023, Vol. 631, p218-240. 23p.
Publication Year :
2023

Abstract

• Use rules of forms A ℸ B ⇒ z o r ℸ z for feature selection and classification. • Algorithm mining the rules is built based on equivalence classes. • It exploits the downward closure property of negative itemsets and is complete. • Classification using the rules outperforms ten popular classification algorithms. • Feature selection using the rules outperforms two popular variable selection methods. Association rules (ARs) have been applied to classification and variable selection. However, currently, only positive ARs are used for variable selection, while only special forms of positive and negative association rules (PNARs) are used for classification. The purpose of this work was to investigate variable selection and classification methods by mining another, more general form of PNARs, one that is more suitable for binary classification and variable selection problems. The algorithm for mining such PNARs exploits the downward closure property of negative itemsets. It is built based solely on items in a transactional database and on equivalence classes under the support–confidence framework. The algorithm combines the process of mining frequent itemsets and rule generation and is both sound and complete. Experimental results on 10 binary datasets of the variable selection and classification methods using the PNARs mined by the proposed algorithm show that these methods are superior to variable selection methods that use the mutual information measure and the chi-squared test and 10 popular classification algorithms, respectively. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00200255
Volume :
631
Database :
Academic Search Index
Journal :
Information Sciences
Publication Type :
Periodical
Accession number :
162636237
Full Text :
https://doi.org/10.1016/j.ins.2023.02.068