101. A Supervised Filter Feature Selection method for mixed data based on Spectral Feature Selection and Information-theory redundancy analysis
- Author
-
José Fco. Martínez-Trinidad, J. Ariel Carrasco-Ochoa, and Saúl Solorio-Fernández
- Subjects
Computer science ,business.industry ,Pattern recognition ,Feature selection ,02 engineering and technology ,Information theory ,01 natural sciences ,ComputingMethodologies_PATTERNRECOGNITION ,Redundancy (information theory) ,Artificial Intelligence ,0103 physical sciences ,Signal Processing ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Spectral analysis ,Computer Vision and Pattern Recognition ,Artificial intelligence ,010306 general physics ,business ,Software - Abstract
Spectral analysis and Information-theory are two powerful and successful frameworks for feature selection in supervised classification problems. However, most of the methods developed under these frameworks have been introduced for handling exclusively numerical or non- numerical data. In this paper, we propose a supervised filter feature selection method that combines Spectral Feature Selection and Information-theory based redundancy analysis for selecting relevant and non-redundant features in supervised mixed datasets; i.e., datasets where the objects are described simultaneously by both, numerical and non-numerical features. To demonstrate the effectiveness of our proposed supervised filter feature selection method, we conducted several experiments on 40 public real-world datasets. Additionally, we compare our method against relevant state-of-the-art supervised filter methods for numerical, non-numerical, and mixed data. From this comparison, our method, in general, obtains better results than the results obtained by the other evaluated filter feature selection methods.
- Published
- 2020