Back to Search Start Over

A feature engineering-based machine learning technique to detect and classify lung and colon cancer from histopathological images.

Authors :
Chhillar, Indu
Singh, Ajmer
Source :
Medical & Biological Engineering & Computing. Mar2024, Vol. 62 Issue 3, p913-924. 12p.
Publication Year :
2024

Abstract

Globally, lung and colon cancers are among the most prevalent and lethal tumors. Early cancer identification is essential to increase the likelihood of survival. Histopathological images are considered an appropriate tool for diagnosing cancer, which is tedious and error-prone if done manually. Recently, machine learning methods based on feature engineering have gained prominence in automatic histopathological image classification. Furthermore, these methods are more interpretable than deep learning, which operates in a "black box" manner. In the medical profession, the interpretability of a technique is critical to gaining the trust of end users to adopt it. In view of the above, this work aims to create an accurate and interpretable machine-learning technique for the automated classification of lung and colon cancers from histopathology images. In the proposed approach, following the preprocessing steps, texture and color features are retrieved by utilizing the Haralick and Color histogram feature extraction algorithms, respectively. The obtained features are concatenated to form a single feature set. The three feature sets (texture, color, and combined features) are passed into the Light Gradient Boosting Machine (LightGBM) classifier for classification. And their performance is evaluated on the LC25000 dataset using hold-out and stratified 10-fold cross-validation (Stratified 10-FCV) techniques. With a test/hold-out set, the LightGBM with texture, color, and combined features classifies the lung and colon cancer images with 97.72%, 99.92%, and 100% accuracy respectively. In addition, a stratified 10-fold cross-validation method also revealed that LightGBM's combined or color features performed well, with an excellent mean auc_mu score and a low mean multi_logloss value. Thus, this proposed technique can help histologists detect and classify lung and colon histopathology images more efficiently, effectively, and economically, resulting in more productivity. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01400118
Volume :
62
Issue :
3
Database :
Academic Search Index
Journal :
Medical & Biological Engineering & Computing
Publication Type :
Academic Journal
Accession number :
175566381
Full Text :
https://doi.org/10.1007/s11517-023-02984-y