Back to Search Start Over

Annotation and extraction of age and temporally-related events from clinical histories.

Authors :
Hong J
Davoudi A
Yu S
Mowery DL
Source :
BMC medical informatics and decision making [BMC Med Inform Decis Mak] 2020 Dec 30; Vol. 20 (Suppl 11), pp. 338. Date of Electronic Publication: 2020 Dec 30.
Publication Year :
2020

Abstract

Background: Age and time information stored within the histories of clinical notes can provide valuable insights for assessing a patient's disease risk, understanding disease progression, and studying therapeutic outcomes. However, details of age and temporally-specified clinical events are not well captured, consistently codified, and readily available to research databases for study.<br />Methods: We expanded upon existing annotation schemes to capture additional age and temporal information, conducted an annotation study to validate our expanded schema, and developed a prototypical, rule-based Named Entity Recognizer to extract our novel clinical named entities (NE). The annotation study was conducted on 138 discharge summaries from the pre-annotated 2014 ShARe/CLEF eHealth Challenge corpus. In addition to existing NE classes (TIMEX3, SUBJECT_CLASS, DISEASE_DISORDER), our schema proposes 3 additional NEs (AGE, PROCEDURE, OTHER_EVENTS). We also propose new attributes, e.g., "degree_relation" which captures the degree of biological relation for subjects annotated under SUBJECT_CLASS. As a proof of concept, we applied the schema to 49 H&P notes to encode pertinent history information for a lung cancer cohort study.<br />Results: An abundance of information was captured under the new OTHER_EVENTS, PROCEDURE and AGE classes, with 23%, 10% and 8% of all annotated NEs belonging to the above classes, respectively. We observed high inter-annotator agreement of >80% for AGE and TIMEX3; the automated NLP system achieved F1 scores of 86% (AGE) and 86% (TIMEX3). Age and temporally-specified mentions within past medical, family, surgical, and social histories were common in our lung cancer data set; annotation is ongoing to support this translational research study.<br />Conclusions: Our annotation schema and NLP system can encode historical events from clinical notes to support clinical and translational research studies.

Details

Language :
English
ISSN :
1472-6947
Volume :
20
Issue :
Suppl 11
Database :
MEDLINE
Journal :
BMC medical informatics and decision making
Publication Type :
Academic Journal
Accession number :
33380319
Full Text :
https://doi.org/10.1186/s12911-020-01333-5