Back to Search Start Over

Mixed-methods evaluation of three natural language processing modeling approaches for measuring documented goals-of-care discussions in the electronic health record

Authors :
Alison M. Uyeda
J. Randall Curtis
Ruth A. Engelberg
Lyndia C. Brumback
Yue Guo
James Sibley
William B. Lober
Trevor Cohen
Janaki Torrence
Joanna Heywood
Sudiptho R. Paul
Erin K. Kross
Robert Y. Lee
Source :
J Pain Symptom Manage
Publication Year :
2022
Publisher :
Elsevier BV, 2022.

Abstract

CONTEXT: Documented goals-of-care discussions are an important quality metric for patients with serious illness. Natural language processing (NLP) is a promising approach for identifying goals-of-care discussions in the electronic health record (EHR). OBJECTIVES: To compare three NLP modeling approaches for identifying EHR documentation of goals-of-care discussions and generate hypotheses about differences in performance. METHODS: We conducted a mixed-methods study to evaluate performance and misclassification for three NLP featurization approaches modeled with regularized logistic regression: bag-of-words (BOW), rule-based, and a hybrid approach. From a prospective cohort of 150 patients hospitalized with serious illness over 2018–2020, we collected 4,391 inpatient EHR notes; 99 (2.3%) contained documented goals-of-care discussions. We used leave-one-out cross-validation to estimate performance by comparing pooled NLP predictions to human abstractors with receiver-operating-characteristic (ROC) and precision-recall (PR) analyses. We qualitatively examined a purposive sample of 70 NLP-misclassified notes using content analysis to identify linguistic features that allowed us to generate hypotheses underpinning misclassification. RESULTS: All three modeling approaches discriminated between notes with and without goals-of-care discussions (AUC(ROC): BOW, 0.907; rule-based, 0.948; hybrid, 0.965). Precision and recall were only moderate (precision at 70% recall: BOW, 16.2%; rule-based, 50.4%; hybrid, 49.3%; AUC(PR): BOW, 0.505; rule-based, 0.579; hybrid, 0.599). Qualitative analysis revealed patterns underlying performance differences between BOW and rule-based approaches. CONCLUSION: NLP holds promise for identifying EHR-documented goals-of-care discussions. However, the rarity of goals-of-care content in EHR data limits performance. Our findings highlight opportunities to optimize NLP modeling approaches, and support further exploration of different NLP approaches to identify goals-of-care discussions.

Details

ISSN :
08853924
Volume :
63
Database :
OpenAIRE
Journal :
Journal of Pain and Symptom Management
Accession number :
edsair.doi.dedup.....c0c3e7641f3b5c6346888198fa97bc0c
Full Text :
https://doi.org/10.1016/j.jpainsymman.2022.02.006