Back to Search Start Over

Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data

Authors :
Mathieu Ravaut
Hamed Sadeghi
Kin Kwan Leung
Maksims Volkovs
Kathy Kornas
Vinyas Harish
Tristan Watson
Gary F. Lewis
Alanna Weisman
Tomi Poutanen
Laura Rosella
Source :
npj Digital Medicine, Vol 4, Iss 1, Pp 1-12 (2021)
Publication Year :
2021
Publisher :
Nature Portfolio, 2021.

Abstract

Abstract Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7–77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.

Details

Language :
English
ISSN :
23986352
Volume :
4
Issue :
1
Database :
Directory of Open Access Journals
Journal :
npj Digital Medicine
Publication Type :
Academic Journal
Accession number :
edsdoj.618f79c514f043dc8f26ffb41f8e1df6
Document Type :
article
Full Text :
https://doi.org/10.1038/s41746-021-00394-8