251. Augmented Intelligence with Natural Language Processing Applied to Electronic Health Records is Useful for Identifying Patients with Non-Alcoholic Fatty Liver Disease at Risk for Disease Progression
- Author
-
Girish N. Nadkarni, Stephen B. Ellis, Lili Chan, Joseph L. Kannry, Peter A Bonis, Judy H. Cho, Ron Do, Tielman Van Vleck, Catherine K. Craven, Ruth J. F. Loos, and Steven G. Coca
- Subjects
Modalities ,business.industry ,Fatty liver ,Full text search ,Disease ,computer.software_genre ,medicine.disease ,3. Good health ,03 medical and health sciences ,Liver disease ,0302 clinical medicine ,Documentation ,Intelligence amplification ,medicine ,030211 gastroenterology & hepatology ,030212 general & internal medicine ,Artificial intelligence ,Diagnosis code ,business ,computer ,Natural language processing - Abstract
ObjectiveElectronic health record (EHR) systems contain structured data and unstructured documentation. Clinical insights can be derived from analyzing both but optimal methods for this have not been studied extensively. We compared various approaches to analyzing EHR data for non-alcoholic fatty liver disease (NAFLD).Materials and MethodsWe compared analysis of structured and unstructured EHR data using natural language processing (NLP), free-text search, and diagnostic codes against expert adjudication as the reference standard.ResultsOut of 38,575 patients, we identified 2,281 patients with NAFLD. From the remainder, 10,653 patients with similar data density were selected as a control group. NLP was more sensitive than ICD and text search (NLP 0.93 vs. ICD 0.28 vs. text search 0.81) with higher a F2 score (NLP 0.92 vs. ICD 0.34 vs. text search 0.81). 619 patients had suspected NAFLD documented in radiology notes not acknowledged in other forms of clinical documentation. Of these, 232 (37.5%) were found to have more advanced liver disease after a median of 1,057 days.DiscussionNLP-based approaches have superior accuracy in identifying NAFLD within the EHR compared to ICD/text search-based approaches. Suspected NAFLD on imaging is often not acknowledged in subsequent clinical documentation. Many such patients are later found to have more advanced liver disease.ConclusionFor identification of NAFLD, NLP performed better than alternative selection modalities and facilitated follow-on analysis of information flow. If accuracy can be proven to persist across clinical domains, NLP can identify patient phenotypes for biomedical research in an accurate and high-throughput manner.
- Published
- 2019
- Full Text
- View/download PDF