Back to Search Start Over

Cancer Classification Utilizing Voting Classifier with Ensemble Feature Selection Method and Transcriptomic Data.

Authors :
Khatun R
Akter M
Islam MM
Uddin MA
Talukder MA
Kamruzzaman J
Azad A
Paul BK
Almoyad MAA
Aryal S
Moni MA
Source :
Genes [Genes (Basel)] 2023 Sep 14; Vol. 14 (9). Date of Electronic Publication: 2023 Sep 14.
Publication Year :
2023

Abstract

Biomarker-based cancer identification and classification tools are widely used in bioinformatics and machine learning fields. However, the high dimensionality of microarray gene expression data poses a challenge for identifying important genes in cancer diagnosis. Many feature selection algorithms optimize cancer diagnosis by selecting optimal features. This article proposes an ensemble rank-based feature selection method (EFSM) and an ensemble weighted average voting classifier (VT) to overcome this challenge. The EFSM uses a ranking method that aggregates features from individual selection methods to efficiently discover the most relevant and useful features. The VT combines support vector machine, k-nearest neighbor, and decision tree algorithms to create an ensemble model. The proposed method was tested on three benchmark datasets and compared to existing built-in ensemble models. The results show that our model achieved higher accuracy, with 100% for leukaemia, 94.74% for colon cancer, and 94.34% for the 11-tumor dataset. This study concludes by identifying a subset of the most important cancer-causing genes and demonstrating their significance compared to the original data. The proposed approach surpasses existing strategies in accuracy and stability, significantly impacting the development of ML-based gene analysis. It detects vital genes with higher precision and stability than other existing methods.

Details

Language :
English
ISSN :
2073-4425
Volume :
14
Issue :
9
Database :
MEDLINE
Journal :
Genes
Publication Type :
Academic Journal
Accession number :
37761941
Full Text :
https://doi.org/10.3390/genes14091802