1. Supervised machine learning for the early prediction of acute respiratory distress syndrome (ARDS)
- Author
-
Emily Pellegrini, Jana Hoffman, Charlotte Summers, Abigail Green-Saxena, Sidney Le, Ritankar Das, Jacob Calvert, Summers, Charlotte [0000-0002-7269-2873], and Apollo - University of Cambridge Repository
- Subjects
Adult ,Male ,ARDS ,Adolescent ,Critical Care ,Databases, Factual ,Acute respiratory distress ,Critical Care and Intensive Care Medicine ,Machine learning ,computer.software_genre ,Clinical decision support system ,Cross-validation ,law.invention ,Young Adult ,03 medical and health sciences ,0302 clinical medicine ,Risk Factors ,law ,Intensive care ,Early prediction ,Electronic health records ,Humans ,Medicine ,Intensive care unit ,Aged ,Retrospective Studies ,Respiratory Distress Syndrome ,Acute respiratory distress syndrome ,Receiver operating characteristic ,business.industry ,Medical record ,Clinical decision support systems ,030208 emergency & critical care medicine ,Middle Aged ,medicine.disease ,Intensive Care Units ,Early Diagnosis ,ROC Curve ,030228 respiratory system ,Medical informatics ,Area Under Curve ,Test set ,Female ,Supervised Machine Learning ,Artificial intelligence ,business ,computer - Abstract
PurposeAcute respiratory distress syndrome (ARDS) is a serious respiratory condition with high mortality and associated morbidity. The objective of this study is to develop and evaluate a novel application of gradient boosted tree models trained on patient health record data for the early prediction of ARDS.Materials and Methods9919 patient encounters were retrospectively analyzed from the Medical Information Mart for Intensive Care III (MIMIC-III) data base. XGBoost gradient boosted tree models for early ARDS prediction were created using routinely collected clinical variables and numerical representations of radiology reports as inputs. XGBoost models were iteratively trained and validated using 10-fold cross validation.ResultsOn a hold-out test set, algorithm classifiers attained area under the receiver operating characteristic curve (AUROC) values of 0.905, 0.827, 0.810, and 0.790 when tested for the prediction of ARDS at 0-, 12-, 24-, and 48-hour windows prior to onset, respectively.ConclusionSupervised machine learning predictions may help predict patients with ARDS up to 48 hours prior to onset.
- Published
- 2020
- Full Text
- View/download PDF