Back to Search
Start Over
Optimizing breast cancer diagnosis: combining hybrid architectures through Apache Spark.
- Source :
- International Journal of Electrical & Computer Engineering (2088-8708); Aug2024, Vol. 14 Issue 4, p4261-4272, 12p
- Publication Year :
- 2024
-
Abstract
- Early detection and diagnosis of breast cancer are critical for saving lives. This paper addresses two major challenges associated with this task: the vast amount of data processing involved and the need for early detection of breast cancer. To tackle these issues, we developed thirty hybrid architectures by combining five deep learning techniques (Xception, Inception-V3, ResNet50, VGG16, VGG19) as feature extractors and six classifiers (random forest, logistic regression, naive Bayes, gradient-boosted tree, decision tree, and support vector machine) implemented on the Spark framework. We evaluated the performance of these architectures using four classification criteria. The results, analyzed using Scott Knott's statistical test, demonstrated the effectiveness of merging deep learning feature extraction techniques with traditional classifiers for classifying breast cancer into malignant and benign tumors. Notably, the hybrid architecture using logistic regression as the classifier and ResNet50 for feature extraction (RESLR) emerged as the top performer. It achieved impressive accuracy scores of 98.20%, 96.59%, 96.64%, and 94.84% across the Break-His dataset at different magnifications (40X, 100X, 200X, and 400X) respectively. Additionally, RESLR achieved an accuracy of 97.05% on the ICIAR dataset and a remarkable accuracy of 95.31% on the FNAC dataset. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 20888708
- Volume :
- 14
- Issue :
- 4
- Database :
- Complementary Index
- Journal :
- International Journal of Electrical & Computer Engineering (2088-8708)
- Publication Type :
- Academic Journal
- Accession number :
- 178843318
- Full Text :
- https://doi.org/10.11591/ijece.v14i4.pp4261-4272