1. A Novel Blending Approach for Smoking Status Prediction in Hidden Smokers to Reduce Cardiovascular Disease Risk
- Author
-
Muhammad Ammar, Nadeem Javaid, Nabil Alrajeh, Muhammad Shafiq, and Muhammad Aslam
- Subjects
AlexNet ,borderline-synthetic minority oversampling technique ,cardiovascular disease ,deep learning ,echo state network ,GoogleNet ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Smoking has a serious complicated impact on Cardiovascular Health (CVH), which makes it an easily controlled risk factor for Cardiovascular Disease (CVD). The importance of early diagnosis and treatment is crucial because of CVD, which remains to be the leading cause of morbidity and death across the globe, therefore, smoking behaviour prediction is essential. This research explores the complex relationship between smoking and CVD, and clarifies how smoking negatively impacts CVH. The first step of the analysis is removing the Not-a-Number (NaN) values in the smoking behaviour bio signals data because all NaN values are in the target column ‘smoking’, which could lead to generate biased predictions. Further, robust scaling is applied to standardize the data, where all features are on the same scale. The Random Forest feature selection technique is applied to select the most relevant features for smoking status prediction. The Borderline-Synthetic Minority Oversampling Technique is used to balance the minority class with the majority class, to reduce the risk of overfitting in results. Finally, the cleaned and preprocessed data is passed to the Proposed Blending Model (PBM). In PBM, the Echo State Network and GoogleNet are placed on the base layer, and AlexNet is placed on the meta layer to make a robust smoking status prediction model. In terms of accuracy, precision, recall, and f1-score, the PBM outperforms the baseline techniques with an improvement score of 5%, 6%, 5%, and 7%, respectively. Additionally, the PBM’s results are validated through a 10-fold cross-validation method, which can increase its reliability and durability. Finally, the SHapley Additive exPlanations technique is used to understand the predictions made by the PBM. It can reveal the explainability and interpretability of the PBM. Conclusively, during the thorough simulation process, the results are collected and the ablation study is performed. The simulation results show that the early prediction of hidden smokers can help to reduce the effect of the CVDs. It also makes proactive healthcare strategies possible for better future outcomes and public health.
- Published
- 2024
- Full Text
- View/download PDF