Back to Search Start Over

De-black-boxing health AI: demonstrating reproducible machine learning computable phenotypes using the N3C-RECOVER Long COVID model in the All of Us data repository.

Authors :
Pfaff ER
Girvin AT
Crosskey M
Gangireddy S
Master H
Wei WQ
Kerchberger VE
Weiner M
Harris PA
Basford M
Lunt C
Chute CG
Moffitt RA
Haendel M
Source :
Journal of the American Medical Informatics Association : JAMIA [J Am Med Inform Assoc] 2023 Jun 20; Vol. 30 (7), pp. 1305-1312.
Publication Year :
2023

Abstract

Machine learning (ML)-driven computable phenotypes are among the most challenging to share and reproduce. Despite this difficulty, the urgent public health considerations around Long COVID make it especially important to ensure the rigor and reproducibility of Long COVID phenotyping algorithms such that they can be made available to a broad audience of researchers. As part of the NIH Researching COVID to Enhance Recovery (RECOVER) Initiative, researchers with the National COVID Cohort Collaborative (N3C) devised and trained an ML-based phenotype to identify patients highly probable to have Long COVID. Supported by RECOVER, N3C and NIH's All of Us study partnered to reproduce the output of N3C's trained model in the All of Us data enclave, demonstrating model extensibility in multiple environments. This case study in ML-based phenotype reuse illustrates how open-source software best practices and cross-site collaboration can de-black-box phenotyping algorithms, prevent unnecessary rework, and promote open science in informatics.<br /> (© The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1527-974X
Volume :
30
Issue :
7
Database :
MEDLINE
Journal :
Journal of the American Medical Informatics Association : JAMIA
Publication Type :
Academic Journal
Accession number :
37218289
Full Text :
https://doi.org/10.1093/jamia/ocad077