Back to Search Start Over

Comparison of machine learning and logistic regression as predictive models for adverse maternal and neonatal outcomes of preeclampsia: A retrospective study

Authors :
Dongying Zheng
Xinyu Hao
Muhanmmad Khan
Lixia Wang
Fan Li
Ning Xiang
Fuli Kang
Timo Hamalainen
Fengyu Cong
Kedong Song
Chong Qiao
Source :
Frontiers in Cardiovascular Medicine, Vol 9 (2022)
Publication Year :
2022
Publisher :
Frontiers Media S.A., 2022.

Abstract

IntroductionPreeclampsia, one of the leading causes of maternal and fetal morbidity and mortality, demands accurate predictive models for the lack of effective treatment. Predictive models based on machine learning algorithms demonstrate promising potential, while there is a controversial discussion about whether machine learning methods should be recommended preferably, compared to traditional statistical models.MethodsWe employed both logistic regression and six machine learning methods as binary predictive models for a dataset containing 733 women diagnosed with preeclampsia. Participants were grouped by four different pregnancy outcomes. After the imputation of missing values, statistical description and comparison were conducted preliminarily to explore the characteristics of documented 73 variables. Sequentially, correlation analysis and feature selection were performed as preprocessing steps to filter contributing variables for developing models. The models were evaluated by multiple criteria.ResultsWe first figured out that the influential variables screened by preprocessing steps did not overlap with those determined by statistical differences. Secondly, the most accurate imputation method is K-Nearest Neighbor, and the imputation process did not affect the performance of the developed models much. Finally, the performance of models was investigated. The random forest classifier, multi-layer perceptron, and support vector machine demonstrated better discriminative power for prediction evaluated by the area under the receiver operating characteristic curve, while the decision tree classifier, random forest, and logistic regression yielded better calibration ability verified, as by the calibration curve.ConclusionMachine learning algorithms can accomplish prediction modeling and demonstrate superior discrimination, while Logistic Regression can be calibrated well. Statistical analysis and machine learning are two scientific domains sharing similar themes. The predictive abilities of such developed models vary according to the characteristics of datasets, which still need larger sample sizes and more influential predictors to accumulate evidence.

Details

Language :
English
ISSN :
2297055X
Volume :
9
Database :
Directory of Open Access Journals
Journal :
Frontiers in Cardiovascular Medicine
Publication Type :
Academic Journal
Accession number :
edsdoj.3ae3e60ffbc432d839ac7629624e30c
Document Type :
article
Full Text :
https://doi.org/10.3389/fcvm.2022.959649