Back to Search Start Over

Can-CSC-GBE: Developing Cost-sensitive Classifier with Gentleboost Ensemble for breast cancer classification using protein amino acids and imbalanced data

Authors :
Abdul Majid
Safdar Ali
Syed Gibran Javed
Mohsin Sattar
Source :
Computers in biology and medicine. 73
Publication Year :
2015

Abstract

Early prediction of breast cancer is important for effective treatment and survival. We developed an effective Cost-Sensitive Classifier with GentleBoost Ensemble (Can-CSC-GBE) for the classification of breast cancer using protein amino acid features. In this work, first, discriminant information of the protein sequences related to breast tissue is extracted. Then, the physicochemical properties hydrophobicity and hydrophilicity of amino acids are employed to generate molecule descriptors in different feature spaces. For comparison, we obtained results by combining Cost-Sensitive learning with conventional ensemble of AdaBoostM1 and Bagging. The proposed Can-CSC-GBE system has effectively reduced the misclassification costs and thereby improved the overall classification performance. Our novel approach has highlighted promising results as compared to the state-of-the-art ensemble approaches.

Details

ISSN :
18790534
Volume :
73
Database :
OpenAIRE
Journal :
Computers in biology and medicine
Accession number :
edsair.doi.dedup.....196e901a2ccd395e18468c36749ac201