Back to Search
Start Over
Feature Selection Based on Pairwise Classification Performance
- Source :
- Computer Aided Systems Theory-EUROCAST 2009 ISBN: 9783642047718, EUROCAST
- Publication Year :
- 2009
- Publisher :
- Springer Berlin Heidelberg, 2009.
-
Abstract
- The process of feature selection is an important first step in building machine learning models. Feature selection algorithms can be grouped into wrappers and filters; the former use machine learning models to evaluate feature sets, the latter use other criteria to evaluate features individually. We present a new approach to feature selection that combines advantages of both wrapper as well as filter approaches, by using logistic regression and the area under the ROC curve (AUC) to evaluate pairs of features. After choosing as starting feature the one with the highest individual discriminatory power, we incrementally rank features by choosing as next feature the one that achieves the highest AUC in combination with an already chosen feature. To evaluate our approach, we compared it to standard filter and wrapper algorithms. Using two data sets from the biomedical domain, we are able to demonstrate that the performance of our approach exceeds that of filter methods, while being comparable to wrapper methods at smaller computational cost.
- Subjects :
- Computer science
business.industry
Dimensionality reduction
Feature vector
Feature extraction
Pattern recognition
Feature selection
Filter (signal processing)
computer.software_genre
k-nearest neighbors algorithm
Feature (computer vision)
Artificial intelligence
Data mining
business
Feature learning
computer
Subjects
Details
- ISBN :
- 978-3-642-04771-8
- ISBNs :
- 9783642047718
- Database :
- OpenAIRE
- Journal :
- Computer Aided Systems Theory-EUROCAST 2009 ISBN: 9783642047718, EUROCAST
- Accession number :
- edsair.doi...........984a3e17fb82ebaf6dd7d78661bea837
- Full Text :
- https://doi.org/10.1007/978-3-642-04772-5_99