Back to Search Start Over

Detecting changes in the performance of a clinical machine learning tool over time.

Authors :
Schinkel M
Boerman AW
Paranjape K
Wiersinga WJ
Nanayakkara PWB
Source :
EBioMedicine [EBioMedicine] 2023 Nov; Vol. 97, pp. 104823. Date of Electronic Publication: 2023 Oct 02.
Publication Year :
2023

Abstract

Background: Excessive use of blood cultures (BCs) in Emergency Departments (EDs) results in low yields and high contamination rates, associated with increased antibiotic use and unnecessary diagnostics. Our team previously developed and validated a machine learning model to predict BC outcomes and enhance diagnostic stewardship. While the model showed promising initial results, concerns over performance drift due to evolving patient demographics, clinical practices, and outcome rates warrant continual monitoring and evaluation of such models.<br />Methods: A real-time evaluation of the model's performance was conducted between October 2021 and September 2022. The model was integrated into Amsterdam UMC's Electronic Health Record system, predicting BC outcomes for all adult patients with BC draws in real time. The model's performance was assessed monthly using metrics including the Area Under the Curve (AUC), Area Under the Precision-Recall Curve (AUPRC), and Brier scores. Statistical Process Control (SPC) charts were used to monitor variation over time.<br />Findings: Across 3.035 unique adult patient visits, the model achieved an average AUC of 0.78, AUPRC of 0.41, and a Brier score of 0.10 for predicting the outcome of BCs drawn in the ED. While specific population characteristics changed over time, no statistical points outside the statistical control range were detected in the AUC, AUPRC, and Brier scores, indicating stable model performance. The average BC positivity rate during the study period was 13.4%.<br />Interpretation: Despite significant changes in clinical practice, our BC stewardship tool exhibited stable performance, suggesting its robustness to changing environments. Using SPC charts for various metrics enables simple and effective monitoring of potential performance drift. The assessment of the variation of outcome rates and population changes may guide the specific interventions, such as intercept correction or recalibration, that may be needed to maintain a stable model performance over time. This study suggested no need to recalibrate or correct our BC stewardship tool.<br />Funding: No funding to disclose.<br />Competing Interests: Declaration of interests The authors declare no competing interests regarding this work.<br /> (Copyright © 2023 The Author(s). Published by Elsevier B.V. All rights reserved.)

Details

Language :
English
ISSN :
2352-3964
Volume :
97
Database :
MEDLINE
Journal :
EBioMedicine
Publication Type :
Academic Journal
Accession number :
37793210
Full Text :
https://doi.org/10.1016/j.ebiom.2023.104823