1. The design of dynamic ensemble selection strategy for the error-correcting output codes family
- Author
-
Qingqiang Wu, Kun-Hong Liu, Jia-Yu Zou, and Mengxin Sun
- Subjects
Information Systems and Management ,Source code ,Computer science ,media_common.quotation_subject ,Feature selection ,02 engineering and technology ,computer.software_genre ,Theoretical Computer Science ,Set (abstract data type) ,Multiclass classification ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,computer.programming_language ,media_common ,05 social sciences ,050301 education ,Python (programming language) ,Computer Science Applications ,Control and Systems Engineering ,020201 artificial intelligence & image processing ,Data mining ,0503 education ,computer ,Software ,Decoding methods ,Coding (social sciences) - Abstract
Error-Correcting Output Codes (ECOC) is widely deployed to tackle the multiclass classification problem by reducing the original multi-class problem to several binary sub-problems. This study attempts to design a dynamic ensemble selection strategy to promote the performance of ECOC algorithms. Concretely, each column in a coding matrix is matched with a set of feature subsets generated by various feature selection methods. In the decoding process, a novel criterion based on the data complexity theory is proposed to pick up an optimal feature subset from the candidate subsets, so as to better distinguish unknown samples. As this strategy can be embedded in all types of ECOC algorithms, seven classical ECOC algorithms are deployed to verify the effectiveness of our strategy. Experiments are carried out on a set of UCI data sets, and the results confirm that despite different working principle, the proposed strategy can further improve the performance of various ECOC algorithms in most cases. Our python source code is available at: https://github.com/MLDMXM2017/ECOC_DES .
- Published
- 2021
- Full Text
- View/download PDF