33 results on '"Breast cancer prediction"'
Search Results
2. Modeling and Predictive Analytics of Breast Cancer Using Ensemble Learning Techniques: An Explainable Artificial Intelligence Approach.
- Author
-
Raha, Avi Deb, Dihan, Fatema Jannat, Gain, Mrityunjoy, Murad, Saydul Akbar, Adhikary, Apurba, Hossain, Md. Bipul, Hassan, Md. Mehedi, Al-Shehari, Taher, Alsadhan, Nasser A., Kadrie, Mohammed, and Bairagi, Anupam Kumar
- Subjects
MACHINE learning ,ENSEMBLE learning ,ARTIFICIAL intelligence ,RANDOM forest algorithms ,BREAST cancer ,BREAST - Abstract
Breast cancer stands as one of the world's most perilous and formidable diseases, having recently surpassed lung cancer as the most prevalent cancer type. This disease arises when cells in the breast undergo unregulated proliferation, resulting in the formation of a tumor that has the capacity to invade surrounding tissues. It is not confined to a specific gender; both men and women can be diagnosed with breast cancer, although it is more frequently observed in women. Early detection is pivotal in mitigating its mortality rate. The key to curbing its mortality lies in early detection. However, it is crucial to explain the black-box machine learning algorithms in this field to gain the trust of medical professionals and patients. In this study, we experimented with various machine learning models to predict breast cancer using the Wisconsin Breast Cancer Dataset (WBCD) dataset. We applied Random Forest, XGBoost, Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), and Gradient Boost classifiers, with the Random Forest model outperforming the others. A comparison analysis between the two methods was done after performing hyperparameter tuning on each method. The analysis showed that the random forest performs better and yields the highest result with 99.46% accuracy. After performance evaluation, two Explainable Artificial Intelligence (XAI) methods, SHapley Additive exPlanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME), have been utilized to explain the random forest machine learning model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. An enhanced soft-computing based strategy for efficient feature selection for timely breast cancer prediction: Wisconsin Diagnostic Breast Cancer dataset case.
- Author
-
Singh, Law Kumar, Khanna, Munish, and Singh, Rekha
- Subjects
COMPUTER-aided diagnosis ,OPTIMIZATION algorithms ,FEATURE selection ,METAHEURISTIC algorithms ,MEDICAL personnel - Abstract
When contemplating the improvement of overall performance in machine learning (ML) models, a critical strategy for optimizing data preparation is feature selection (FS). There has been a significant rise in the popularity of metaheuristic FS algorithms in recent times. This can be attributed to their proficiency in accurately identifying and selecting the most relevant features for ML tasks. This study presents three feature selection strategies that utilize metaheuristic algorithms. The methodologies mentioned include the Gravitational Search Optimization Algorithm (GSA), Emperor Penguin Optimization (EPO), and a hybrid approach of GSA and EPO referred to as hGSAEPO. Previous research has explored the use of baseline algorithms for feature selection in various ML tasks. However, there is a lack of investigation regarding their application specifically in breast cancer(BC) classification. A combination of these two has been utilized for the first occasion. The purpose of selecting BC as the study of investigation is due to the reason that this illness is recognized as the second most prevalent cause of mortality in the female population. If the condition is detected in its initial phases, it can be remedied and can assist individuals in evading superfluous medical processes. The procedure of selecting relevant features holds significant importance in the purpose of predicting ailments like BC. The current research presents an innovative methodology that employs three soft-computing algorithms, EPO, GSA, and their proposed hybrid hGSAEPO to efficiently identify significant features while concurrently decreasing the occurrence of irrelevant ones, simplifying overall complexity and enhancing the accuracy. The utilization of these soft computing methodologies and six ML classifiers presents a viable framework for prognostic research through the classification of data instances on Wisconsin Diagnostic Breast Cancer (WDBC). The experimental findings of eight experiments conducted suggest that the suggested approach exhibits exceptional performance in the context of binary classification for BC by computing astounding results like precision of 0.9800, sensitivity of 0.9700, specificity of 0.9887, F1-score of 0.9539, area under the curve(AUC) surpassing 0.998, with an accuracy of 98.31%. We achieved our objectives by presenting a dependable clinical prediction system for healthcare professionals for efficient diagnosis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. An Enhanced Feature Selection Integrated Explainable Machine Learning Models for Prediction of Breast Cancers.
- Author
-
Reddy, Samreddy Pooja and Deepa, K.
- Subjects
MACHINE learning ,CLINICAL decision support systems ,FEATURE selection ,BREAST cancer ,PREDICTION models - Abstract
The prediction of breast cancer has evolved significantly with the advent of machine learning models, which offer promising tools for early diagnosis and personalized treatment planning. This systematic literature review examines 31 peer-reviewed analyses undertaken between 2019 and 2024, targeting on the integration of enhanced feature selection methods within explainable machine learning models for breast cancer prediction. The primary goal is to assess how feature selection techniques contribute to model accuracy and interpretability, ensuring that predictions are both reliable and understandable to clinicians and researchers. The reviewed studies highlight a trend towards the use of sophisticated feature selection algorithms that refine input data, yielding models that not only enhance prediction accuracy but also generate actionable insights to inform decisions. This is especially important in healthcare, where the explainability of AI models can enhance trust and adoption by medical professionals, potentially improving patient outcomes. Despite these advancements, the review identifies several challenges, including the need for large, diverse datasets to ensure model generalizability and the difficulty of balancing model complexity with interpretability. Furthermore, the integration of these models into clinical workflows remains a significant hurdle due to varying levels of transparency and the potential for bias in feature selection. Gaps in the current literature suggest that future research concentrate on creating developing standardized frameworks for integrating feature selection with explainable models, ensuring that these tools are both effective and widely applicable in clinical settings. This review aims to guide future research and development efforts towards creating more robust, transparent, and clinically useful predictive models for breast cancer, ultimately contributing to more precise and personalized healthcare solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
5. Breast Cancer Prediction Using Hybridization of Machine Learning and Optimization Technique
- Author
-
Mittal, Ayushi, Gupta, Charu, Tayal, Devendra Kumar, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Hassanien, Aboul Ella, editor, Anand, Sameer, editor, Jaiswal, Ajay, editor, and Kumar, Prabhat, editor
- Published
- 2024
- Full Text
- View/download PDF
6. Breast Cancer Detection: An Evaluation of Machine Learning, Ensemble Learning, and Deep Learning Algorithms
- Author
-
Rai, Deepak, Mishra, Tripti, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Chauhan, Naveen, editor, Yadav, Divakar, editor, Verma, Gyanendra K., editor, Soni, Badal, editor, and Lara, Jorge Morato, editor
- Published
- 2024
- Full Text
- View/download PDF
7. Breast Cancer Detection Using Optimal Machine Learning Techniques: Uncovering the Most Effective Approach
- Author
-
Joshi, Tanmay, Hegadi, Ravindra, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Santosh, KC, editor, Makkar, Aaisha, editor, Conway, Myra, editor, Singh, Ashutosh K., editor, Vacavant, Antoine, editor, Abou el Kalam, Anas, editor, Bouguelia, Mohamed-Rafik, editor, and Hegadi, Ravindra, editor
- Published
- 2024
- Full Text
- View/download PDF
8. An Enhancement in Accuracy for Breast Cancer Prediction Using Machine Learning and Deep Learning Model
- Author
-
Panda, Subham, Kumar, Bagesh, Kumar, Chandan, Sharma, Vaidik, Bhardwaj, Akash, Gautam, Shubhendra, Kumar, Vishal, Vyas, O. P., Das, Swagatam, Series Editor, Bansal, Jagdish Chand, Series Editor, Tavares, João Manuel R. S., editor, Rodrigues, Joel J. P. C., editor, Misra, Debajyoti, editor, and Bhattacherjee, Debasmriti, editor
- Published
- 2024
- Full Text
- View/download PDF
9. Predictive modeling for breast cancer classification in the context of Bangladeshi patients by use of machine learning approach with explainable AI
- Author
-
Taminul Islam, Md. Alif Sheakh, Mst. Sazia Tahosin, Most. Hasna Hena, Shopnil Akash, Yousef A. Bin Jardan, Gezahign FentahunWondmie, Hiba-Allah Nafidi, and Mohammed Bourhia
- Subjects
Breast cancer prediction ,Machine learning ,Cancer prediction ,Hyperparameter tuning ,Explainable AI ,Medicine ,Science - Abstract
Abstract Breast cancer has rapidly increased in prevalence in recent years, making it one of the leading causes of mortality worldwide. Among all cancers, it is by far the most common. Diagnosing this illness manually requires significant time and expertise. Since detecting breast cancer is a time-consuming process, preventing its further spread can be aided by creating machine-based forecasts. Machine learning and Explainable AI are crucial in classification as they not only provide accurate predictions but also offer insights into how the model arrives at its decisions, aiding in the understanding and trustworthiness of the classification results. In this study, we evaluate and compare the classification accuracy, precision, recall, and F1 scores of five different machine learning methods using a primary dataset (500 patients from Dhaka Medical College Hospital). Five different supervised machine learning techniques, including decision tree, random forest, logistic regression, naive bayes, and XGBoost, have been used to achieve optimal results on our dataset. Additionally, this study applied SHAP analysis to the XGBoost model to interpret the model’s predictions and understand the impact of each feature on the model’s output. We compared the accuracy with which several algorithms classified the data, as well as contrasted with other literature in this field. After final evaluation, this study found that XGBoost achieved the best model accuracy, which is 97%.
- Published
- 2024
- Full Text
- View/download PDF
10. A light gradient boosting machine learning-based approach for predicting clinical data breast cancer
- Author
-
Qiuqian, Wang, GaoMin, KeZhu, Zhang, and Chenchen
- Published
- 2025
- Full Text
- View/download PDF
11. Predictive modeling for breast cancer classification in the context of Bangladeshi patients by use of machine learning approach with explainable AI.
- Author
-
Islam, Taminul, Sheakh, Md. Alif, Tahosin, Mst. Sazia, Hena, Most. Hasna, Akash, Shopnil, Bin Jardan, Yousef A., FentahunWondmie, Gezahign, Nafidi, Hiba-Allah, and Bourhia, Mohammed
- Subjects
MACHINE learning ,TUMOR classification ,PREDICTION models ,BREAST cancer ,ARTIFICIAL intelligence - Abstract
Breast cancer has rapidly increased in prevalence in recent years, making it one of the leading causes of mortality worldwide. Among all cancers, it is by far the most common. Diagnosing this illness manually requires significant time and expertise. Since detecting breast cancer is a time-consuming process, preventing its further spread can be aided by creating machine-based forecasts. Machine learning and Explainable AI are crucial in classification as they not only provide accurate predictions but also offer insights into how the model arrives at its decisions, aiding in the understanding and trustworthiness of the classification results. In this study, we evaluate and compare the classification accuracy, precision, recall, and F1 scores of five different machine learning methods using a primary dataset (500 patients from Dhaka Medical College Hospital). Five different supervised machine learning techniques, including decision tree, random forest, logistic regression, naive bayes, and XGBoost, have been used to achieve optimal results on our dataset. Additionally, this study applied SHAP analysis to the XGBoost model to interpret the model's predictions and understand the impact of each feature on the model's output. We compared the accuracy with which several algorithms classified the data, as well as contrasted with other literature in this field. After final evaluation, this study found that XGBoost achieved the best model accuracy, which is 97%. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Hybrid LightGBM Model for Breast Cancer Diagnosis.
- Author
-
XING Changzheng and XU Jiayu
- Abstract
Breast cancer is one of the most common types of cancer, and its prevalence continues to rise every year. Without surgical biopsy, it can effectively provide auxiliary diagnosis and treatment for doctors and reduce the pain of patients by analyzing various indicators of the nucleus to predict whether the mass is benign or not. Therefore, a breast cancer diagnosis model based on LightGBM algorithm is proposed. Firstly, the borderline- synthetic minority oversampling technique (Borderline-SMOTE) is used to improve the problem of imbalanced breast cancer diagnosis data. Secondly, the PWLCM chaotic map, the new inertia weight and the criss-cross algorithm are introduced into the sparrow search algorithm (SSA) to improve it, and then the improved SSA algorithm is used to automatically optimize the parameters of LightGBM. Then, because LightGBM is sensitive to noise, an OVR-Jacobian regularization method is proposed to reduce the noise of LightGBM. Finally, the improved LightGBM hybrid model is used to diagnose breast cancer. The experimental results show that the proposed hybrid model is superior to the common models in the three indicators of mean square error, coefficient of determination and cross-validation score, showing its better diagnostic effect. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. On the Effectiveness of Dimensionality Reduction Techniques on High Dimensionality Datasets
- Author
-
Henouda, Salah Eddine, Laallam, Fatima Zohra, Kazar, Okba, Harous, Saad, Houfani, Djihane, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Laouar, Mohamed Ridda, editor, Balas, Valentina Emilia, editor, Lejdel, Brahim, editor, Eom, Sean, editor, and Boudia, Mohamed Amine, editor
- Published
- 2023
- Full Text
- View/download PDF
14. Correlation-based feature selection and Smote-Tomek Link to improve the performance of machine learning methods on cancer disease prediction.
- Author
-
Putra, Lalu Ganda Rady, Marzuki, Khairan, and Hairani, Hairani
- Subjects
MACHINE learning ,NAIVE Bayes classification ,MACHINE performance ,EARLY detection of cancer ,FEATURE selection ,CANCER cell growth ,ARCHIPELAGOES - Abstract
Indonesia is an archipelago with the fourth largest population in the world, with a population of 283 million. In Indonesia, breast cancer ranks first in cancer and is the highest contributor to death. Deaths caused by breast cancer can be minimized by screening and early detection to avoid the risk of more severe cancer. Early detection of breast cancer can delay the growth of cancer cells and increase the chances of recovery. This research proposed a machine learning-based application for screening and early detection of breast cancer independently based on perceived symptoms. However, developing breast cancer early detection applications requires a very high level of accuracy to minimize prediction errors. This research focused on finding the optimal accuracy of the machine learning method so that it could predict breast cancer with a very low error rate. This research aimed to improve the performance of classification methods in breast cancer disease prediction using the correlation feature selection approach and hybrid sampling Smote-Tomek Link. This research utilized Support Vector Machine (SVM) and Naive Bayes classification methods with a combination of Smote-Tomek Link hybrid sampling approach and correlation feature selection. Hybrid Sampling Smote-Tomek Link balanced the data by minimizing noise in the data created. At the same time, the correlation feature selection method was used to select relevant or influential attributes with class attributes based on a strong correlation level (= 0.6) between input attributes and classes. The results of this study obtained that the SVM method with hybrid sampling and correlation feature selection obtained the best performance compared to the Naive Bayes method and previous research referred to with an accuracy of 96.80%, sensitivity of 96.80%, and specificity of 96.80%. Thus, using the Smote-Tomek Link hybrid sampling approach and correlation feature selection positively impacted the performance increase in the SVM and Naive Bayes methods for breast cancer prediction. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
15. Explainable extreme boosting model for breast cancer diagnosis.
- Author
-
Suresh, Tamilarasi, Assegie, Tsehay Admassu, Ganesan, Sangeetha, Tulasi, Ravulapalli Lakshmi, Mothukuri, Radha, and Salau, Ayodeji Olalekan
- Subjects
CANCER diagnosis ,BREAST ,BREAST cancer - Abstract
This study investigates the Shapley additive explanation (SHAP) of the extreme boosting (XGBoost) model for breast cancer diagnosis. The study employed Wisconsin’s breast cancer dataset, characterized by 30 features extracted from an image of a breast cell. SHAP module generated different explainer values representing the impact of a breast cancer feature on breast cancer diagnosis. The experiment computed SHAP values of 569 samples of the breast cancer dataset. The SHAP explanation indicates perimeter and concave points have the highest impact on breast cancer diagnosis. SHAP explains the XGB model diagnosis outcome showing the features affecting the XGBoost model. The developed XGB model achieves an accuracy of 98.42% [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
16. An optimized efficient combinatorial learning using deep neural network and statistical techniques.
- Author
-
V K, Jyothi and Sarma, Guda Ramachandra Kaladhara
- Subjects
- *
ARTIFICIAL neural networks , *ARTIFICIAL intelligence , *DEEP learning , *MACHINE learning , *HEART diseases , *MEDICAL statistics - Abstract
Research work is to discover the rapid requirement of Artificial Intelligence and Statistics in medical research. Objective is to design a diagnostic prediction system that can detect and predict diseases at an early stage from clinical data sets. Some of major diseases leading reasons of death globally are heart disease and cancer. There are different kinds of cancer, in this study we focused on breast cancer and heart disease. Prediction of these diseases at a very early stage is curable and preventive diagnosis can control death rate. Designed two Artificial Intelligence systems for prediction of above-mentioned diseases using statistics and Deep neural networks (i) Combinatorial Learning (CLSDnn) and (ii) an optimized efficient Combinatorial Learning (eCLSDnn). To evaluate the performance of the proposed system conducted experiments on three different data sets, in which two data sets are of breast cancer namely, Wisconsin-data set of UCI Machine Learning repository and AI for Social Good: Women Coders' Bootcamp data set and Cleveland heart disease data set of UCI Machine Learning repository. The proposed architectures of binary classification are validated for 70%–30% data splitting and on K-fold cross validation. Recognition of Malignant cancerous tumors CLSDnn model achieved maximum accuracy of 98.53% for Wisconsin data set, 95.32% for AI for Social Good: Women Coders' data set and 96.72% for Cleveland data set. Recognition of Malignant cancerous tumors eCLSDnn model achieved 99.36% for Wisconsin data set, 97.12% for AI for Social Good: Women Coders' data set and 99.56% for the Cleveland heart disease data set. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. Tackling bias in the data for breast cancer prediction using machine learning-based decision support.
- Author
-
Yin, Shuning, Nanda, Gaurav, and Sundararajan, Raji
- Subjects
- *
MACHINE learning , *BREAST cancer , *FORECASTING , *LOGISTIC regression analysis , *CANCER patients - Abstract
In this study, a machine learning (ML)-based decision support approach was developed to identify breast cancer likelihood in patients, based on their background and physiological data. Two ML models, Naïve Bayes and Logistic Regression were used to evaluate the Breast Cancer Surveillance Consortium dataset that had about 9:1 ratio of non-cancer cases ('Class 0') to cancer cases ('Class 1'). We manually built both balanced and unbalanced training datasets and a non-overlapping testing dataset using a stratified sampling method. For each model, we partitioned the prediction results on testing set into two groups, the 'Agree' group included cases where balanced and unbalanced ML predictions agreed, and the remaining cases come under 'Disagree' group. Sensitivity and Positive Predictive Value were used as the prediction performance measures. For Naïve Bayes, the sensitivity of Class 1 in regular versus 'Agree' group increased from 0.687 to 0.936 and for Logistic Regression, it increased from 0.358 to 0.8306. This indicates the 'Agree' group predictions were more accurate and could be labeled as high-confidence ML predictions. The 'Agree' group consisted of 89% cases in the testing set, so the improved prediction performance was applicable for a large portion of the testing dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning.
- Author
-
Shafique, Rahman, Rustam, Furqan, Choi, Gyu Sang, Díez, Isabel de la Torre, Mahmood, Arif, Lipari, Vivian, Velasco, Carmen Lili Rodríguez, and Ashraf, Imran
- Subjects
- *
MACHINE learning , *RESEARCH funding , *BREAST tumors , *NEEDLE biopsy - Abstract
Simple Summary: Breast cancer is prevalent in women and the second leading cause of death. Conventional breast cancer detection methods require several laboratory tests and medical experts. Automated breast cancer detection is thus very important for timely treatment. This study explores the influence of various feature selection technique to increase the performance of machine learning methods for breast cancer detection. Experimental results shows that use of appropriate features tend to show highly accurate prediction. Breast cancer is one of the most common invasive cancers in women and it continues to be a worldwide medical problem since the number of cases has significantly increased over the past decade. Breast cancer is the second leading cause of death from cancer in women. The early detection of breast cancer can save human life but the traditional approach for detecting breast cancer disease needs various laboratory tests involving medical experts. To reduce human error and speed up breast cancer detection, an automatic system is required that would perform the diagnosis accurately and timely. Despite the research efforts for automated systems for cancer detection, a wide gap exists between the desired and provided accuracy of current approaches. To overcome this issue, this research proposes an approach for breast cancer prediction by selecting the best fine needle aspiration features. To enhance the prediction accuracy, several feature selection techniques are applied to analyze their efficacy, such as principal component analysis, singular vector decomposition, and chi-square (Chi2). Extensive experiments are performed with different features and different set sizes of features to investigate the optimal feature set. Additionally, the influence of imbalanced and balanced data using the SMOTE approach is investigated. Six classifiers including random forest, support vector machine, gradient boosting machine, logistic regression, multilayer perceptron, and K-nearest neighbors (KNN) are tuned to achieve increased classification accuracy. Results indicate that KNN outperforms all other classifiers on the used dataset with 20 features using SVD and with the 15 most important features using a PCA with a 100% accuracy score. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
19. Accuracy Assessment of Machine Learning Algorithms Used to Predict Breast Cancer.
- Author
-
Ebrahim, Mohamed, Sedky, Ahmed Ahmed Hesham, and Mesbah, Saleh
- Subjects
MACHINE learning ,FEATURE selection ,RECURRENT neural networks ,BREAST cancer ,SUPPORT vector machines - Abstract
Machine learning (ML) was used to develop classification models to predict individual tumor patients' outcomes. Binary classification defined whether the tumor was malignant or benign. This paper presents a comparative analysis of machine learning algorithms used for breast cancer prediction. This study used a dataset obtained from the National Cancer Institute (NIH), USA, which contains 1.7 million data records. Classical and deep learning methods were included in the accuracy assessment. Classical decision tree (DT), linear discriminant (LD), logistic regression (LR), support vector machine (SVM), and ensemble techniques (ET) algorithms were used. Probabilistic neural network (PNN), deep neural network (DNN), and recurrent neural network (RNN) methods were used for comparison. Feature selection and its effect on accuracy were also investigated. The results showed that decision trees and ensemble techniques outperformed the other techniques, as they both achieved a 98.7% accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
20. Breast Cancer Detection Using Convoluted Features and Ensemble Machine Learning Algorithm.
- Author
-
Umer, Muhammad, Naveed, Mahum, Alrowais, Fadwa, Ishaq, Abid, Hejaili, Abdullah Al, Alsubai, Shtwai, Eshmawi, Ala' Abdulmajid, Mohamed, Abdullah, and Ashraf, Imran
- Subjects
- *
BREAST tumor diagnosis , *RESEARCH methodology , *MACHINE learning , *EARLY detection of cancer , *CANCER patients , *BREAST , *DESCRIPTIVE statistics , *ARTIFICIAL neural networks , *SENSITIVITY & specificity (Statistics) , *LOGISTIC regression analysis , *ALGORITHMS , *BREAST tumors - Abstract
Simple Summary: This paper presents a breast cancer detection approach where the convoluted features from a convolutional neural network are utilized to train a machine learning model. Results demonstrate that use of convoluted features yields better results than the original features to classify malignant and benign tumors. Breast cancer is a common cause of female mortality in developing countries. Screening and early diagnosis can play an important role in the prevention and treatment of these cancers. This study proposes an ensemble learning-based voting classifier that combines the logistic regression and stochastic gradient descent classifier with deep convoluted features for the accurate detection of cancerous patients. Deep convoluted features are extracted from the microscopic features and fed to the ensemble voting classifier. This idea provides an optimized framework that accurately classifies malignant and benign tumors with improved accuracy. Results obtained using the voting classifier with convoluted features demonstrate that the highest classification accuracy of 100% is achieved. The proposed approach revealed the accuracy enhancement in comparison with the state-of-the-art approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. Breast Cancer Prognosis Using Machine Learning Techniques and Genetic Algorithm: Experiment on Six Different Datasets
- Author
-
Jijitha, S., Amudha, Thangavel, Xhafa, Fatos, Series Editor, Suma, V., editor, Bouhmala, Noureddine, editor, and Wang, Haoxiang, editor
- Published
- 2021
- Full Text
- View/download PDF
22. Explainable Ensemble Machine Learning for Breast Cancer Diagnosis Based on Ultrasound Image Texture Features.
- Author
-
Rezazadeh, Alireza, Jafarian, Yasamin, and Kord, Ali
- Subjects
MACHINE learning ,BREAST cancer diagnosis ,ULTRASONIC imaging ,DECISION trees ,PREDICTION models - Abstract
Image classification is widely used to build predictive models for breast cancer diagnosis. Most existing approaches overwhelmingly rely on deep convolutional networks to build such diagnosis pipelines. These model architectures, although remarkable in performance, are black-box systems that provide minimal insight into the inner logic behind their predictions. This is a major drawback as the explainability of prediction is vital for applications such as cancer diagnosis. In this paper, we address this issue by proposing an explainable machine learning pipeline for breast cancer diagnosis based on ultrasound images. We extract first- and second-order texture features of the ultrasound images and use them to build a probabilistic ensemble of decision tree classifiers. Each decision tree learns to classify the input ultrasound image by learning a set of robust decision thresholds for texture features of the image. The decision path of the model predictions can then be interpreted by decomposing the learned decision trees. Our results show that our proposed framework achieves high predictive performance while being explainable. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Accuracy Assessment of Machine Learning Algorithms Used to Predict Breast Cancer
- Author
-
Mohamed Ebrahim, Ahmed Ahmed Hesham Sedky, and Saleh Mesbah
- Subjects
deep learning ,feature selection ,machine learning ,breast cancer prediction ,tumor classification ,Bibliography. Library science. Information resources - Abstract
Machine learning (ML) was used to develop classification models to predict individual tumor patients’ outcomes. Binary classification defined whether the tumor was malignant or benign. This paper presents a comparative analysis of machine learning algorithms used for breast cancer prediction. This study used a dataset obtained from the National Cancer Institute (NIH), USA, which contains 1.7 million data records. Classical and deep learning methods were included in the accuracy assessment. Classical decision tree (DT), linear discriminant (LD), logistic regression (LR), support vector machine (SVM), and ensemble techniques (ET) algorithms were used. Probabilistic neural network (PNN), deep neural network (DNN), and recurrent neural network (RNN) methods were used for comparison. Feature selection and its effect on accuracy were also investigated. The results showed that decision trees and ensemble techniques outperformed the other techniques, as they both achieved a 98.7% accuracy.
- Published
- 2023
- Full Text
- View/download PDF
24. Prediction of Breast Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis
- Author
-
Noreen Fatima, Li Liu, Sha Hong, and Haroon Ahmed
- Subjects
Machine learning ,breast cancer prediction ,deep learning ,data mining ,ensemble techniques ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Breast cancer is type of tumor that occurs in the tissues of the breast. It is most common type of cancer found in women around the world and it is among the leading causes of deaths in women. This article presents the comparative analysis of machine learning, deep learning and data mining techniques being used for the prediction of breast cancer. Many researchers have put their efforts on breast cancer diagnoses and prognoses, every technique has different accuracy rate and it varies for different situations, tools and datasets being used. Our main focus is to comparatively analyze different existing Machine Learning and Data Mining techniques in order to find out the most appropriate method that will support the large dataset with good accuracy of prediction. The main purpose of this review is to highlight all the previous studies of machine learning algorithms that are being used for breast cancer prediction and this article provides the all necessary information to the beginners who want to analyze the machine learning algorithms to gain the base of deep learning.
- Published
- 2020
- Full Text
- View/download PDF
25. Pendekatan Machine Learning yang Efisien untuk Prediksi Kanker Payudara
- Author
-
Azminuddin I. S. Azis, Irma Surya Kumala Idris, Budy Santoso, and Yasin Aril Mustofa
- Subjects
machine learning ,breast cancer prediction ,missing value replacement ,feature selection ,unbalanced class ,Systems engineering ,TA168 ,Information technology ,T58.5-58.64 - Abstract
Breast Cancer is the most common cancer found in women and the death rate is still in second place among other cancers. The high accuracy of the machine learning approach that has been proposed by related studies is often achieved. However, without efficient pre-processing, the model of Breast Cancer prediction that was proposed is still in question. Therefore, this research objective to improve the accuracy of machine learning methods through pre-processing: Missing Value Replacement, Data Transformation, Smoothing Noisy Data, Feature Selection / Attribute Weighting, Data Validation, and Unbalanced Class Reduction which is more efficient for Breast Cancer prediction. The results of this study propose several approaches: C4.5 - Z-Score - Genetic Algorithm for Breast Cancer Dataset with 77,27% accuracy, 7-Nearest Neighbor - Min-Max Normalization - Particle Swarm Optimization for Wisconsin Breast Cancer Dataset - Original with 97,85% accuracy, Artificial Neural Network - Z-Score - Forward Selection for Wisconsin Breast Cancer Dataset - Diagnostics with 98,24% accuracy, and 11-Nearest Neighbor - Min-Max Normalization - Particle Swarm Optimization for Wisconsin Breast Cancer Dataset - Prognostic with 83,33% accuracy. The performance of these approaches is better than standard/normal machine learning methods and the proposed methods by the best of previous related studies.
- Published
- 2019
- Full Text
- View/download PDF
26. Intelligent Breast Cancer Prediction Empowered with Fusion and Deep Learning.
- Author
-
Siddiqui, Shahan Yamin, Naseer, Iftikhar, Khan, Muhammad Adnan, Mushtaq, Muhammad Faheem, Naqvi, Rizwan Ali, Hussain, Dildar, and Haider, Amir
- Subjects
DEEP learning ,BREAST cancer ,COMPUTER-aided diagnosis ,MULTIMODAL user interfaces ,COMPUTER-assisted image analysis (Medicine) ,MACHINE learning ,BREAST cancer prognosis - Abstract
Breast cancer is the most frequently detected tumor that eventually could result in a significant increase in female mortality globally. According to clinical statistics, one woman out of eight is under the threat of breast cancer. Lifestyle and inheritance patterns may be a reason behind its spread among women. However, some preventive measures, such as tests and periodic clinical checks can mitigate its risk thereby, improving its survival chances substantially. Early diagnosis and initial stage treatment can help increase the survival rate. For that purpose, pathologists can gather support from nondestructive and efficient computer-aided diagnosis (CAD) systems. This study explores the breast cancer CAD method relying on multimodal medical imaging and decision-based fusion. In multimodal medical imaging fusion, a deep learning approach is applied, obtaining 97.5% accuracy with a 2.5% miss rate for breast cancer prediction. A deep extreme learning machine technique applied on feature-based data provided a 97.41% accuracy. Finally, decision-based fusion applied to both breast cancer prediction models to diagnose its stages, resulted in an overall accuracy of 97.97%. The proposed system model provides more accurate results compared with other state-of-the-art approaches, rapidly diagnosing breast cancer to decrease its mortality rate. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
27. EFFECTIVE DIAGNOSIS OF BREAST CANCER USING KNN ALGORITHM.
- Author
-
PATIL, DEEPALI A., BADARPURA, SHAKIB, JAIN, ABHISHEK, and GUPTA, ANIKET
- Subjects
BREAST cancer diagnosis ,MACHINE learning ,K-nearest neighbor classification ,NEURAL circuitry ,DECISION trees - Abstract
Cancer is one of the deadliest diseases in human beings. Breast cancer is considered to be the second most exposed cancer in the world and is now the most common disease in women. The women of ages 45-59 has the highest number of chances to be affected by breast cancer. Early prediction and diagnosis of breast cancer can prevent its spread and may help with effective treatment or medication. Predicting breast cancer is a very arduous task as the data can be highly Non-linear and may require high level computation modeling. However, many machine learning algorithms like KNN, K-Means, Decision Trees, Neural Networks etc., have proved to be effective in predicting breast cancer. This study shows the use of k-Nearest Neighbors (kNN) algorithm to predict whether a person is having breast cancer or not, using a machine learning model trained with different features. Thus, we inferred that we could predict the Breast Cancer with reasonable accuracy. From the results, it can be concluded that breast cancer cells can be accurately detected using machine learning techniques such as KNN. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
28. Prediction of Breast Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis
- Author
-
Li Liu, Noreen Fatima, Haroon Ahmed, and Sha Hong
- Subjects
General Computer Science ,Computer science ,02 engineering and technology ,Machine learning ,computer.software_genre ,breast cancer prediction ,ensemble techniques ,Breast cancer ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,General Materials Science ,Medical diagnosis ,business.industry ,Deep learning ,General Engineering ,Cancer ,deep learning ,020206 networking & telecommunications ,data mining ,medicine.disease ,ComputingMethodologies_PATTERNRECOGNITION ,020201 artificial intelligence & image processing ,Artificial intelligence ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,business ,computer ,lcsh:TK1-9971 - Abstract
Breast cancer is type of tumor that occurs in the tissues of the breast. It is most common type of cancer found in women around the world and it is among the leading causes of deaths in women. This article presents the comparative analysis of machine learning, deep learning and data mining techniques being used for the prediction of breast cancer. Many researchers have put their efforts on breast cancer diagnoses and prognoses, every technique has different accuracy rate and it varies for different situations, tools and datasets being used. Our main focus is to comparatively analyze different existing Machine Learning and Data Mining techniques in order to find out the most appropriate method that will support the large dataset with good accuracy of prediction. The main purpose of this review is to highlight all the previous studies of machine learning algorithms that are being used for breast cancer prediction and this article provides the all necessary information to the beginners who want to analyze the machine learning algorithms to gain the base of deep learning.
- Published
- 2020
29. Pendekatan Machine Learning yang Efisien untuk Prediksi Kanker Payudara
- Author
-
Irma Surya Kumala Idris, Azminuddin I. S. Azis, Yasin Aril Mustofa, and Budy Santoso
- Subjects
Normalization (statistics) ,Computer science ,Data validation ,Feature selection ,Machine learning ,computer.software_genre ,breast cancer prediction ,lcsh:TA168 ,Breast cancer ,feature selection ,medicine ,Artificial neural network ,lcsh:T58.5-58.64 ,business.industry ,lcsh:Information technology ,Particle swarm optimization ,medicine.disease ,prediksi kanker payudara ,Weighting ,machine learning ,lcsh:Systems engineering ,missing value replacement ,unbalanced class ,machine leraning ,Artificial intelligence ,business ,computer ,Smoothing - Abstract
Breast Cancer is the most common cancer found in women and the death rate is still in second place among other cancers. The high accuracy of the machine learning approach that has been proposed by related studies is often achieved. However, without efficient pre-processing, the model of Breast Cancer prediction that was proposed is still in question. Therefore, this research objective to improve the accuracy of machine learning methods through pre-processing: Missing Value Replacement, Data Transformation, Smoothing Noisy Data, Feature Selection / Attribute Weighting, Data Validation, and Unbalanced Class Reduction which is more efficient for Breast Cancer prediction. The results of this study propose several approaches: C4.5 - Z-Score - Genetic Algorithm for Breast Cancer Dataset with 77,27% accuracy, 7-Nearest Neighbor - Min-Max Normalization - Particle Swarm Optimization for Wisconsin Breast Cancer Dataset - Original with 97,85% accuracy, Artificial Neural Network - Z-Score - Forward Selection for Wisconsin Breast Cancer Dataset - Diagnostics with 98,24% accuracy, and 11-Nearest Neighbor - Min-Max Normalization - Particle Swarm Optimization for Wisconsin Breast Cancer Dataset - Prognostic with 83,33% accuracy. The performance of these approaches is better than standard/normal machine learning methods and the proposed methods by the best of previous related studies.  , Kanker Payudara merupakan kanker yang paling sering ditemukan pada wanita dan tingkat kematiannya masih berada pada posisi dua di antara penyakit kanker laiinya. Akurasi yang tinggi dari pendekatan machine learning yang diusulkan oleh penelitian-penelitian terkait sering dicapai. Namun tanpa pra-pengolahan yang efisien, maka model prediksi Kanker Payudara yang diusulkan masih diragukan. Oleh karena itu, penelitian ini bertujuan untuk meningkatkan kinerja akurasi metode-metode machine learning melalui pra-pengolahan: Missing Value Replacement, Data Transformation, Smoothing Noisy Data, Feature Selection / Attribute Weighting, Data Validation, dan Unbalanced Class Reduction yang lebih efisien untuk prediksi Kanker Payudara. Hasil penelitian ini mengusulkan pendekatan: C4.5 – Z-Score – Genetic Algorithm untuk Breast Cancer Dataset dengan akurasi 77,27%, 7-Nearest Neighbor – Min-Max Normalization – Particle Swarm Optimization untuk Wisconsin Breast Cancer Dataset - Original dengan akurasi 97,85%, Artificial Neural Network – Z-Score – Forward Selection untuk Wisconsin Breast Cancer Dataset – Diagnostic dengan akurasi 98,24%, dan 11-Nearest Neighbor – Min-Max Normalization – Particle Swarm Optimization untuk Wisconsin Breast Cancer Dataset – Prognostic dengan akurasi 83,33%. Kinerja pendekatan-pendekatan tersebut lebih baik dari metode-metode machine learning standar/normal dan metode yang diusulkan penelitian-penelitian terkait sebelumnya yang terbaik.  
- Published
- 2019
30. Breast Cancer Prediction: A Comparative Study Using Machine Learning Techniques
- Author
-
Islam, Md. Milon, Haque, Md. Rezwanul, Iqbal, Hasib, Hasan, Md. Munirul, Hasan, Mahmudul, and Kabir, Muhammad Nomani
- Published
- 2020
- Full Text
- View/download PDF
31. Breast cancer prediction model with decision tree and adaptive boosting
- Author
-
R. Lakshmi Tulasi, N. Komal Kumar, and Tsehay Admassu Assegie
- Subjects
Information Systems and Management ,Boosting (machine learning) ,business.industry ,Computer science ,Adaboost ,Decision tree ,Machine learning ,computer.software_genre ,medicine.disease ,Adaboost algorithm ,ComputingMethodologies_PATTERNRECOGNITION ,Breast cancer ,Artificial Intelligence ,Control and Systems Engineering ,medicine ,Artificial intelligence ,AdaBoost ,Electrical and Electronic Engineering ,business ,Breast cancer prediction ,computer - Abstract
In this study, breast cancer prediction model is proposed with decision tree and adaptive boosting (Adboost). Furthermore, an extensive experimental evaluation of the predictive performance of the proposed model is conducted. The study is conducted on breast cancer dataset collected form the kaggle data repository. The dataset consists of 569 observations of which the 212 or 37.25% are benign or breast cancer negative and 62.74% are malignant or breast cancer positive. The class distribution shows that, the dataset is highly imbalanced and a learning algorithm such as decision tree is biased to the benign observation and results in poor performance on predicting the malignant observation. To improve the performance of the decision tree on the malignant observation, boosting algorithm namely, the adaptive boosting is employed. Finally, the predictive performance of the decision tree and adaptive boosting is analyzed. The analysis on predictive performance of the model on the kaggle breast cancer data repository shows that, adaptive boosting has 92.53% accuracy and the accuracy of decision tree is 88.80%, Overall, the adaboost algorithm performed better than decision tree.
- Published
- 2021
32. Robust edge-based biomarker discovery improves prediction of breast cancer metastasis
- Author
-
Chengwei Lei, Nahim Adnan, and Jianhua Ruan
- Subjects
Support Vector Machine ,Computer science ,Gene regulatory network ,computer.software_genre ,Biochemistry ,Metastasis ,chemistry.chemical_compound ,0302 clinical medicine ,Structural Biology ,Molecular marker ,Gene expression ,Gene Regulatory Networks ,Protein Interaction Maps ,Biomarker discovery ,Neoplasm Metastasis ,lcsh:QH301-705.5 ,0303 health sciences ,Breast cancer metastasis ,Applied Mathematics ,Comparative analysis ,Computer Science Applications ,Random forest ,030220 oncology & carcinogenesis ,Area Under Curve ,lcsh:R858-859.7 ,Biomarker (medicine) ,Female ,DNA microarray ,Breast cancer prediction ,Systems biology ,Feature selection ,Breast Neoplasms ,lcsh:Computer applications to medicine. Medical informatics ,Machine learning ,Network-based classification ,03 medical and health sciences ,Breast cancer ,medicine ,Biomarkers, Tumor ,Humans ,Molecular Biology ,030304 developmental biology ,business.industry ,Research ,Cancer ,medicine.disease ,Support vector machine ,Statistical classification ,Logistic Models ,lcsh:Biology (General) ,chemistry ,ROC Curve ,Sample size determination ,Artificial intelligence ,business ,computer - Abstract
Background The abundance of molecular profiling of breast cancer tissues entailed active research on molecular marker-based early diagnosis of metastasis. Recently there is a surging interest in combining gene expression with gene networks such as protein-protein interaction (PPI) network, gene co-expression (CE) network and pathway information to identify robust and accurate biomarkers for metastasis prediction, reflecting the common belief that cancer is a systems biology disease. However, controversy exists in the literature regarding whether network markers are indeed better features than genes alone for predicting as well as understanding metastasis. We believe much of the existing results may have been biased by the overly complicated prediction algorithms, unfair evaluation, and lack of rigorous statistics. In this study, we propose a simple approach to use network edges as features, based on two types of networks respectively, and compared their prediction power using three classification algorithms and rigorous statistical procedure on one of the largest datasets available. To detect biomarkers that are significant for the prediction and to compare the robustness of different feature types, we propose an unbiased and novel procedure to measure feature importance that eliminates the potential bias from factors such as different sample size, number of features, as well as class distribution. Results Experimental results reveal that edge-based feature types consistently outperformed gene-based feature type in random forest and logistic regression models under all performance evaluation metrics, while the prediction accuracy of edge-based support vector machine (SVM) model was poorer, due to the larger number of edge features compared to gene features and the lack of feature selection in SVM model. Experimental results also show that edge features are much more robust than gene features and the top biomarkers from edge feature types are statistically more significantly enriched in the biological processes that are well known to be related to breast cancer metastasis. Conclusions Overall, this study validates the utility of edge features as biomarkers but also highlights the importance of carefully designed experimental procedures in order to achieve statistically reliable comparison results.
- Published
- 2020
33. KERNEL-BASED NAIVE BAYES CLASSIFIER FOR BREAST CANCER PREDICTION.
- Author
-
NAHAR, JESMIN, CHEN, YI-PING PHOEBE, and ALI, SHAWKAT
- Subjects
- *
TUMOR classification , *CANCER diagnosis , *BREAST cancer , *CANCER patients , *CANCER treatment , *WOMEN'S health , *ALGORITHMS , *MACHINE learning , *DECISION trees - Abstract
The classification of breast cancer patients is of great importance in cancer diagnosis. Most classical cancer classification methods are clinical-based and have limited diagnostic ability. The recent advances in machine learning technique has made a great impact in cancer diagnosis. In this research, we develop a new algorithm: Kernel-Based Naive Bayes (KBNB) to classify breast cancer tumor based on memography data. The performance of the proposed algorithm is compared with that of classical navie bayes algorithm and kernel-based decision tree algorithm C4.5. The proposed algorithm is found to outperform in the both cases. We recommend the proposed algorithm could be used as a tool to classify the breast patient for early cancer diagnosis. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.