Back to Search Start Over

COVID-19 Vaccine: Predicting Vaccine Types and Assessing Mortality Risk Through Ensemble Learning Algorithms [version 2; peer review: 2 approved, 2 approved with reservations]

Authors :
Hind Monadhel
Ayad R. Abbas
Athraa Jasim Mohammed
Author Affiliations :
<relatesTo>1</relatesTo>Computer Science, University of Technology - Iraq, Baghdad, Iraq<br /><relatesTo>2</relatesTo>Computer Science, University of Technology- Iraq, Baghdad, Iraq<br /><relatesTo>3</relatesTo>Computer Science, University of Technology- Iraq, Baghdad, Iraq
Source :
F1000Research. 12:1200
Publication Year :
2024
Publisher :
London, UK: F1000 Research Limited, 2024.

Abstract

Background There is no doubt that vaccination is crucial for preventing the spread of diseases; however, not every vaccine is perfect or will work for everyone. The main objective of this work is to predict which vaccine will be most effective for a candidate without causing severe adverse reactions and to categorize a patient as potentially at high risk of death from the COVID-19 vaccine. Methods A comprehensive analysis was conducted using a dataset on COVID-19 vaccine adverse reactions, exploring binary and multiclass classification scenarios. Ensemble models, including Random Forest, Decision Tree, Light Gradient Boosting, and extreme gradient boosting algorithm, were utilized to achieve accurate predictions. Class balancing techniques like SMOTE, TOMEK_LINK, and SMOTETOMEK were incorporated to enhance model performance. Results The study revealed that pre-existing conditions such as diabetes, hypertension, heart disease, history of allergies, prior vaccinations, other medications, age, and gender were crucial factors associated with poor outcomes. Moreover, using medical history, the ensemble learning classifiers achieved accuracy scores ranging from 75% to 87% in predicting the vaccine type and mortality possibility. The Random Forest model emerged as the best prediction model, while the implementation of the SMOTE and SMOTETOMEK methods generally improved model performance. Conclusion The random forest model emerges as the top recommendation for machine learning tasks that require high accuracy and resilience. Moreover, the findings highlight the critical role of medical history in optimizing vaccine outcomes and minimizing adverse reactions.

Details

ISSN :
20461402
Volume :
12
Database :
F1000Research
Journal :
F1000Research
Notes :
Revised Amendments from Version 1 All the reviewers' comments have been addressed, and a new section was added to clarify the algorithm and sampling method selection, as well as parameter tuning. Additionally, a new table and figure were included., , [version 2; peer review: 2 approved, 2 approved with reservations]
Publication Type :
Academic Journal
Accession number :
edsfor.10.12688.f1000research.140395.2
Document Type :
research-article
Full Text :
https://doi.org/10.12688/f1000research.140395.2