Start Over

Automated detection of substance use information from electronic health records for a pediatric population

Authors :: Alycia Bachtel
Yizhao Ni
Sarah J. Beal
Katie Nause
Source :: Journal of the American Medical Informatics Association : JAMIA
Publication Year :: 2021
Publisher :: Oxford University Press (OUP), 2021.
Abstract: Objective Substance use screening in adolescence is unstandardized and often documented in clinical notes, rather than in structured electronic health records (EHRs). The objective of this study was to integrate logic rules with state-of-the-art natural language processing (NLP) and machine learning technologies to detect substance use information from both structured and unstructured EHR data. Materials and Methods Pediatric patients (10-20 years of age) with any encounter between July 1, 2012, and October 31, 2017, were included (n = 3890 patients; 19 478 encounters). EHR data were extracted at each encounter, manually reviewed for substance use (alcohol, tobacco, marijuana, opiate, any use), and coded as lifetime use, current use, or family use. Logic rules mapped structured EHR indicators to screening results. A knowledge-based NLP system and a deep learning model detected substance use information from unstructured clinical narratives. System performance was evaluated using positive predictive value, sensitivity, negative predictive value, specificity, and area under the receiver-operating characteristic curve (AUC). Results The dataset included 17 235 structured indicators and 27 141 clinical narratives. Manual review of clinical narratives captured 94.0% of positive screening results, while structured EHR data captured 22.0%. Logic rules detected screening results from structured data with 1.0 and 0.99 for sensitivity and specificity, respectively. The knowledge-based system detected substance use information from clinical narratives with 0.86, 0.79, and 0.88 for AUC, sensitivity, and specificity, respectively. The deep learning model further improved detection capacity, achieving 0.88, 0.81, and 0.85 for AUC, sensitivity, and specificity, respectively. Finally, integrating predictions from structured and unstructured data achieved high detection capacity across all cases (0.96, 0.85, and 0.87 for AUC, sensitivity, and specificity, respectively). Conclusions It is feasible to detect substance use screening and results among pediatric patients using logic rules, NLP, and machine learning technologies.

Details

ISSN :: 1527974X
Volume :: 28
Database :: OpenAIRE
Journal :: Journal of the American Medical Informatics Association
Accession number :: edsair.doi.dedup.....069f5a78d2cc5ddb390a2bdada325873
Full Text :: https://doi.org/10.1093/jamia/ocab116

Full Text Access

View/download PDF

Tools

Bookmark
Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Automated detection of substance use information from electronic health records for a pediatric population

Automated detection of substance use information from electronic health records for a pediatric population

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Automated detection of substance use information from electronic health records for a pediatric population

Automated detection of substance use information from electronic health records for a pediatric population

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources