Back to Search
Start Over
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.
- Source :
-
BMC bioinformatics [BMC Bioinformatics] 2006 Dec 25; Vol. 7, pp. 543. Date of Electronic Publication: 2006 Dec 25. - Publication Year :
- 2006
-
Abstract
- Background: In class prediction problems using microarray data, gene selection is essential to improve the prediction accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVM-RFE) has become one of the leading methods and is being widely used. The SVM-based approach performs gene selection using the weight vector of the hyperplane constructed by the samples on the margin. However, the performance can be easily affected by noise and outliers, when it is applied to noisy, small sample size microarray data.<br />Results: In this paper, we propose a recursive gene selection method using the discriminant vector of the maximum margin criterion (MMC), which is a variant of classical linear discriminant analysis (LDA). To overcome the computational drawback of classical LDA and the problem of high dimensionality, we present efficient and stable algorithms for MMC-based RFE (MMC-RFE). The MMC-RFE algorithms naturally extend to multi-class cases. The performance of MMC-RFE was extensively compared with that of SVM-RFE using nine cancer microarray datasets, including four multi-class datasets.<br />Conclusion: Our extensive comparison has demonstrated that for binary-class datasets MMC-RFE tends to show intermediate performance between hard-margin SVM-RFE and SVM-RFE with a properly chosen soft-margin parameter. Notably, MMC-RFE achieves significantly better performance with a smaller number of genes than SVM-RFE for multi-class datasets. The results suggest that MMC-RFE is less sensitive to noise and outliers due to the use of average margin, and thus may be useful for biomarker discovery from noisy data.
- Subjects :
- Algorithms
Diagnosis, Computer-Assisted methods
Humans
Pattern Recognition, Automated methods
Artificial Intelligence
Biomarkers, Tumor analysis
Gene Expression Profiling methods
Neoplasm Proteins analysis
Neoplasms diagnosis
Neoplasms metabolism
Oligonucleotide Array Sequence Analysis methods
Subjects
Details
- Language :
- English
- ISSN :
- 1471-2105
- Volume :
- 7
- Database :
- MEDLINE
- Journal :
- BMC bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 17187691
- Full Text :
- https://doi.org/10.1186/1471-2105-7-543