Back to Search Start Over

Large-scale identification of aortic stenosis and its severity using natural language processing on electronic health records.

Authors :
Solomon MD
Tabada G
Allen A
Sung SH
Go AS
Source :
Cardiovascular digital health journal [Cardiovasc Digit Health J] 2021 Mar 18; Vol. 2 (3), pp. 156-163. Date of Electronic Publication: 2021 Mar 18 (Print Publication: 2021).
Publication Year :
2021

Abstract

Background: Systematic case identification is critical to improving population health, but widely used diagnosis code-based approaches for conditions like valvular heart disease are inaccurate and lack specificity.<br />Objective: To develop and validate natural language processing (NLP) algorithms to identify aortic stenosis (AS) cases and associated parameters from semi-structured echocardiogram reports and compare their accuracy to administrative diagnosis codes.<br />Methods: Using 1003 physician-adjudicated echocardiogram reports from Kaiser Permanente Northern California, a large, integrated healthcare system (>4.5 million members), NLP algorithms were developed and validated to achieve positive and negative predictive values > 95% for identifying AS and associated echocardiographic parameters. Final NLP algorithms were applied to all adult echocardiography reports performed between 2008 and 2018 and compared to ICD-9/10 diagnosis code-based definitions for AS found from 14 days before to 6 months after the procedure date.<br />Results: A total of 927,884 eligible echocardiograms were identified during the study period among 519,967 patients. Application of the final NLP algorithm classified 104,090 (11.2%) echocardiograms with any AS (mean age 75.2 years, 52% women), with only 67,297 (64.6%) having a diagnosis code for AS between 14 days before and up to 6 months after the associated echocardiogram. Among those without associated diagnosis codes, 19% of patients had hemodynamically significant AS (ie, greater than mild disease).<br />Conclusion: A validated NLP algorithm applied to a systemwide echocardiography database was substantially more accurate than diagnosis codes for identifying AS. Leveraging machine learning-based approaches on unstructured electronic health record data can facilitate more effective individual and population management than using administrative data alone.<br /> (© 2021 Heart Rhythm Society.)

Details

Language :
English
ISSN :
2666-6936
Volume :
2
Issue :
3
Database :
MEDLINE
Journal :
Cardiovascular digital health journal
Publication Type :
Academic Journal
Accession number :
35265904
Full Text :
https://doi.org/10.1016/j.cvdhj.2021.03.003