Back to Search Start Over

Optimizing breast cancer diagnosis: combining hybrid architectures through Apache Spark.

Authors :
Taib, Chaymae
Abdoun, Otman
Haimoudi, El Khatir
Source :
International Journal of Electrical & Computer Engineering (2088-8708); Aug2024, Vol. 14 Issue 4, p4261-4272, 12p
Publication Year :
2024

Abstract

Early detection and diagnosis of breast cancer are critical for saving lives. This paper addresses two major challenges associated with this task: the vast amount of data processing involved and the need for early detection of breast cancer. To tackle these issues, we developed thirty hybrid architectures by combining five deep learning techniques (Xception, Inception-V3, ResNet50, VGG16, VGG19) as feature extractors and six classifiers (random forest, logistic regression, naive Bayes, gradient-boosted tree, decision tree, and support vector machine) implemented on the Spark framework. We evaluated the performance of these architectures using four classification criteria. The results, analyzed using Scott Knott's statistical test, demonstrated the effectiveness of merging deep learning feature extraction techniques with traditional classifiers for classifying breast cancer into malignant and benign tumors. Notably, the hybrid architecture using logistic regression as the classifier and ResNet50 for feature extraction (RESLR) emerged as the top performer. It achieved impressive accuracy scores of 98.20%, 96.59%, 96.64%, and 94.84% across the Break-His dataset at different magnifications (40X, 100X, 200X, and 400X) respectively. Additionally, RESLR achieved an accuracy of 97.05% on the ICIAR dataset and a remarkable accuracy of 95.31% on the FNAC dataset. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20888708
Volume :
14
Issue :
4
Database :
Complementary Index
Journal :
International Journal of Electrical & Computer Engineering (2088-8708)
Publication Type :
Academic Journal
Accession number :
178843318
Full Text :
https://doi.org/10.11591/ijece.v14i4.pp4261-4272