Back to Search Start Over

Granular multi-label feature selection based on mutual information

Authors :
Witold Pedrycz
Duoqian Miao
Feng Li
Source :
Pattern Recognition. 67:410-423
Publication Year :
2017
Publisher :
Elsevier BV, 2017.

Abstract

We granulate the label space into information granules to exploit label dependency.We present a multi-label maximal correlation minimal redundancy criterion.The proposed method can select compact and specific feature subsets.The proposed method can significantly improve the algorithm performance. Like the traditional machine learning, the multi-label learning is faced with the curse of dimensionality. Some feature selection algorithms have been proposed for multi-label learning, which either convert the multi-label feature selection problem into numerous single-label feature selection problems, or directly select features from the multi-label data set. However, the former omit the label dependency, or produce too many new labels leading to learning with significant difficulties; the latter, taking the global label dependency into consideration, usually select a few redundant or irrelevant features, because actually not all labels depend on each other, which may confuse the algorithm and degrade its classification performance. To select a more relevant and compact feature subset as well as explore the label dependency, a granular feature selection method for multi-label learning is proposed with a maximal correlation minimal redundancy criterion based on mutual information. The maximal correlation minimal redundancy criterion makes sure that the selected feature subset contains the most class-discriminative information, while in the meantime exhibits the least intra-redundancy. Granulation can help explore the label dependency. We study the relation of the label granularity and the performance on four data sets, and compare the proposed method with other three multi-label feature selection methods. The experimental results demonstrate that the proposed method can select compact and specific feature subsets, improve the classification performance and performs better than other three methods on the widely-used multi-label learning evaluation criteria.

Details

ISSN :
00313203
Volume :
67
Database :
OpenAIRE
Journal :
Pattern Recognition
Accession number :
edsair.doi...........4a43939abe968fcb8a54a82f37f93784
Full Text :
https://doi.org/10.1016/j.patcog.2017.02.025