Back to Search Start Over

Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials

Authors :
Daniel F. McAuley
Pratik Sinha
Carolyn S. Calfee
Michael A. Matthay
Kevin L. Delucchi
Cecilia M O'Kane
Source :
The Lancet. Respiratory medicine, vol 8, iss 3, Lancet Respir Med, Sinha, P, Delucchi, K L, McAuley, D, O'Kane, C, Matthay, M & Calfee, C S 2020, ' Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials ', The Lancet Respiratory Medicine .
Publication Year :
2020
Publisher :
eScholarship, University of California, 2020.

Abstract

BackgroundUsing latent class analysis (LCA) in five randomised controlled trial (RCT) cohorts, two distinct phenotypes of acute respiratory distress syndrome (ARDS) have been identified: hypoinflammatory and hyperinflammatory. The phenotypes are associated with differential outcomes and treatment response. The objective of this study was to develop parsimonious models for phenotype identification that could be accurate and feasible to use in the clinical setting.MethodsIn this retrospective study, three RCT cohorts from the National Lung, Heart, and Blood Institute ARDS Network (ARMA, ALVEOLI, and FACTT) were used as the derivation dataset (n=2022), from which the machine learning and logistic regression classifer models were derived, and a fourth (SAILS; n=715) from the same network was used as the validation test set. LCA-derived phenotypes in all of these cohorts served as the reference standard. Machine-learning algorithms (random forest, bootstrapped aggregating, and least absolute shrinkage and selection operator) were used to select a maximum of six important classifier variables, which were then used to develop nested logistic regression models. Only cases with complete biomarker data in the derivation dataset were used for variable selection. The best logistic regression models based on parsimony and predictive accuracy were then evaluated in the validation test set. Finally, the models' prognostic validity was tested in two external ARDS clinical trial datasets (START and HARP-2) by assessing mortality at days 28, 60, and 90 and ventilator-free days to day 28.FindingsThe six most important classifier variables were interleukin (IL)-8, IL-6, protein C, soluble tumour necrosis factor receptor 1, bicarbonate, and vasopressor use. From the nested models, three-variable (IL-8, bicarbonate, and protein C) and four-variable (3-variable plus vasopressor use) models were adjudicated to be the best performing. In the validation test set, both models showed good accuracy (AUC 0·94 [95% CI 0·92-0·95] for the three-variable model and 0·95 [95% CI 0·93-0·96] for the four-variable model) against LCA classifications. As with LCA-derived phenotypes, the hyperinflammatory phenotype as identified by the classifier model was associated with higher mortality at day 90 (87 [39%] of 223 patients vs 112 [23%] of 492 patients; p

Details

Database :
OpenAIRE
Journal :
The Lancet. Respiratory medicine, vol 8, iss 3, Lancet Respir Med, Sinha, P, Delucchi, K L, McAuley, D, O'Kane, C, Matthay, M & Calfee, C S 2020, ' Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials ', The Lancet Respiratory Medicine .
Accession number :
edsair.doi.dedup.....50de9299d78785d0a6426842bbec6e62