1. Electronic health record signatures identify undiagnosed patients with common variable immunodeficiency disease.
- Author
-
Johnson, Ruth, Stephens, Alexis V., Mester, Rachel, Knyazev, Sergey, Kohn, Lisa A., Freund, Malika K., Bondhus, Leroy, Hill, Brian L., Schwarz, Tommer, Zaitlen, Noah, Arboleda, Valerie A., A. Bastarache, Lisa, Pasaniuc, Bogdan, and Butte, Manish J.
- Subjects
COMMON variable immunodeficiency ,ELECTRONIC health records ,MACHINE learning ,AUTOIMMUNE diseases ,DELAYED diagnosis ,ARTIFICIAL intelligence - Abstract
Human inborn errors of immunity include rare disorders entailing functional and quantitative antibody deficiencies due to impaired B cells called the common variable immunodeficiency (CVID) phenotype. Patients with CVID face delayed diagnoses and treatments for 5 to 15 years after symptom onset because the disorders are rare (prevalence of ~1/25,000), and there is extensive heterogeneity in CVID phenotypes, ranging from infections to autoimmunity to inflammatory conditions, overlapping with other more common disorders. The prolonged diagnostic odyssey drives excessive system-wide costs before diagnosis. Because there is no single causal mechanism, there are no genetic tests to definitively diagnose CVID. Here, we present PheNet, a machine learning algorithm that identifies patients with CVID from their electronic health records (EHRs). PheNet learns phenotypic patterns from verified CVID cases and uses this knowledge to rank patients by likelihood of having CVID. PheNet could have diagnosed more than half of our patients with CVID 1 or more years earlier than they had been diagnosed. When applied to a large EHR dataset, followed by blinded chart review of the top 100 patients ranked by PheNet, we found that 74% were highly probable to have CVID. We externally validated PheNet using >6 million records from disparate medical systems in California and Tennessee. As artificial intelligence and machine learning make their way into health care, we show that algorithms such as PheNet can offer clinical benefits by expediting the diagnosis of rare diseases. Editor's summary: Common variable immunodeficiency (CVID) disease is an inborn error of immunity characterized by antibody deficiency and impaired B cell responses. This condition can be difficult to diagnose on account of its heterogenous presentation. Johnson et al. developed and validated a machine learning model designed to parse patient electronic health record data and rank individuals according to their likelihood of having CVID. Their retrospective analysis suggested that the method could help diagnose many individuals earlier than standard clinical methods, potentially hastening their referral for specialist treatment. —Catherine Charneski [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF