1. Heart disease prediction using ML through enhanced feature engineering with association and correlation analysis.
- Author
-
Lakshmanarao, Annemneedi, Krishna, Thotakura Venkata Sai, Kiran, Tummala Srinivasa Ravi, krishna, Chinta Venkata Murali, Ushanag, Samsani, and Supriya, Nandikolla
- Subjects
HEART diseases ,STATISTICAL correlation ,MACHINE learning ,SUPPORT vector machines ,K-nearest neighbor classification ,CLASSIFICATION algorithms - Abstract
Heart disease remains a prevalent and critical health concern globally. This paper addresses the critical task of heart disease prediction through the utilization of advanced machine learning techniques. Our approach focuses on the enhancement of feature engineering by incorporating a novel integration of association and correlation analyses. A heart disease dataset from Kaggle was used for the experiments. Association analysis was applied to the categorical and binary features in the dataset. Correlation analysis was applied to the numerical features in the dataset. Based on the insights from association analysis and correlation analysis, a new dataset was created with combinations of features. Later, newly created features are integrated with the original dataset, and classification algorithms are applied. Five machine learning (ML) classifiers, namely decision tree, k-nearest neighbors (KNN), random forest, XG-Boost, and support vector machine (SVM), were applied to the final dataset and achieved a good accuracy rate for heart disease detection. By systematically exploring associations and relationships with categorical, binary, and numerical features, this paper unveils innovative insights that contribute to a more comprehensive understanding of the heart disease dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF