Back to Search
Start Over
Class Dependent Multiple Feature Construction Using Genetic Programming for High-Dimensional Data
- Source :
- AI 2017: Advances in Artificial Intelligence ISBN: 9783319630038, Australasian Conference on Artificial Intelligence
- Publication Year :
- 2017
- Publisher :
- Springer International Publishing, 2017.
-
Abstract
- Genetic Programming (GP) has shown promise in feature construction where high-level features are formed by combining original features using predefined functions or operators. Multiple feature construction methods have been proposed for high-dimensional data with thousands of features. Results of these methods show that several constructed features can maintain or even improve the discriminating ability of the original feature set. However, some particular features may have better ability than other features to distinguish instances of one class from other classes. Therefore, it may be more difficult to construct a better discriminating feature when combing features that are relevant to different classes. In this study, we propose a new GP-based feature construction method called CDFC that constructs multiple features, each of which focuses on distinguishing one class from other classes. We propose a new representation for class-dependent feature construction and a new fitness function to better evaluate the constructed feature set. Results on eight datasets with varying difficulties showed that the features constructed by CDFC can improve the discriminating ability of thousands of original features in most cases. Results also showed that CFDC is more effective and efficient than the hybrid MGPFC method which was shown to have better performance than standard GP to feature construction.
- Subjects :
- 0301 basic medicine
Clustering high-dimensional data
Class (computer programming)
Fitness function
Computer science
business.industry
Genetic programming
Feature selection
Pattern recognition
02 engineering and technology
Machine learning
computer.software_genre
03 medical and health sciences
030104 developmental biology
Feature (computer vision)
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Genetic representation
Artificial intelligence
Representation (mathematics)
business
computer
Subjects
Details
- ISBN :
- 978-3-319-63003-8
- ISBNs :
- 9783319630038
- Database :
- OpenAIRE
- Journal :
- AI 2017: Advances in Artificial Intelligence ISBN: 9783319630038, Australasian Conference on Artificial Intelligence
- Accession number :
- edsair.doi...........bc01388e1d820a0dae9826a21d9c7499