Back to Search Start Over

Early heart disease prediction using feature engineering and machine learning algorithms

Authors :
Mohammed Amine Bouqentar
Oumaima Terrada
Soufiane Hamida
Shawki Saleh
Driss Lamrani
Bouchaib Cherradi
Abdelhadi Raihani
Source :
Heliyon, Vol 10, Iss 19, Pp e38731- (2024)
Publication Year :
2024
Publisher :
Elsevier, 2024.

Abstract

Heart disease is one of the most widespread global health issues, it is the reason behind around 32 % of deaths worldwide every year. The early prediction and diagnosis of heart diseases are critical for effective treatment and sickness management. Despite the efforts of healthcare professionals, cardiovascular surgeons and cardiologists' misdiagnosis and misinterpretation of test results may happen every day. This study addresses the growing global health challenge raised by Cardiovascular Diseases (CVDs), which account for 32 % of all deaths worldwide, according to the World Health Organization (WHO). With the progress of Machine Learning (ML) and Deep Learning (DL) techniques as part of Artificial Intelligence (AI), these technologies have become crucial for predicting and diagnosing CVDs. This research aims to develop an ML system for the early prediction of cardiovascular diseases by choosing one of the powerful existing ML algorithms after a deep comparative analysis of several. To achieve this work, the Cleveland and Statlog heart datasets from international platforms are used in this study to evaluate and validate the system's performance. The Cleveland dataset is categorized and used to train various ML algorithms, including decision tree, random forest, support vector machine, logistic regression, adaptive boosting, and K-nearest neighbors. The performance of each algorithm is assessed based on accuracy, precision, recall, F1 score, and the Area Under the Curve metrics. Hyperparameter tuning approaches have been employed to find the best hyperparameters that reflect the optimal performance of the used algorithms based on different evaluation approaches including 10-fold cross-validation with a 95 % confidence interval. The study's findings highlight the potential of ML in improving the early prediction and diagnosis of cardiovascular diseases. By comparing and analyzing the performance of the applied algorithms on both the Cleveland and Statlog heart datasets, this research contributes to the advancement of ML techniques in the medical field. The developed ML system offers a valuable tool for healthcare professionals in the early prediction and diagnosis of cardiovascular diseases, with implications for the prediction and diagnosis of other diseases as well.

Details

Language :
English
ISSN :
24058440 and 44585373
Volume :
10
Issue :
19
Database :
Directory of Open Access Journals
Journal :
Heliyon
Publication Type :
Academic Journal
Accession number :
edsdoj.80eba87a89e4fd2bf44585373026ae2
Document Type :
article
Full Text :
https://doi.org/10.1016/j.heliyon.2024.e38731