1. Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data
- Author
-
Mengjie Zhang, Wenbin Pei, Lin Shang, and Bing Xue
- Subjects
Clustering high-dimensional data ,Computer science ,business.industry ,05 social sciences ,Granular computing ,Training time ,050301 education ,Genetic programming ,Feature selection ,02 engineering and technology ,Machine learning ,computer.software_genre ,Statistical classification ,ComputingMethodologies_PATTERNRECOGNITION ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Baseline (configuration management) ,business ,0503 education ,computer ,Curse of dimensionality - Abstract
Classification tasks become more challenging when having the curse of dimensionality issue. Recently, there has been an increasing number of datasets with thousands of features. Some classification algorithms often need feature selection to avoid the curse of dimensionality. Genetic programming (GP) has shown success in classification tasks. GP does not require to do feature selection because of its built-in capability to automatically select informative features. However, GP-based methods are often computationally intensive to achieve a good classification accuracy. Based on perspectives from granular computing (GrC), this paper proposes a new approach to linking features hierarchically for GP-based classification. Experiments on seven high-dimensional datasets show the effectiveness of the proposed algorithm in terms of saving training time and enhancing the classification accuracy, compared to baseline methods.
- Published
- 2018