Back to Search
Start Over
From electronic health records to terminology base: A novel knowledge base enrichment approach
- Source :
- Journal of Biomedical Informatics. 113:103628
- Publication Year :
- 2021
- Publisher :
- Elsevier BV, 2021.
-
Abstract
- Enriching terminology base (TB) is an important and continuous process, since formal term can be renamed and new term alias emerges all the time. As a potential supplementary for TB enrichment, electronic health record (EHR) is a fundamental source for clinical research and practise. The task to align the set of external terms in EHRs to TB can be regarded as entity alignment without structure information. Conventional approaches mainly use internal structural information of multiple knowledge bases (KBs) to map entities and their counterparts among KBs. However, the external terms in EHRs are independent clinical terms, which lack of interrelations. To achieve entity alignment in this case, we proposed a novel automatic TB enrichment approach, named semantic & structure embeddings-based relevancy prediction (S2ERP). To obtain the semantic embedding of external terms, we fed them with formal entity into a pre-trained language model. Meanwhile, a graph convolutional network was used to obtain the structure embeddings of the synonyms and hyponyms in TB. Afterwards, S2ERP combines both embeddings to measure the relevancy. Experimental results on clinical indicator TB, collected from 38 top-class hospitals of Shanghai Hospital Development Center, showed that the proposed approach outperforms baseline methods by 14.16% in Hits@1.
- Subjects :
- China
0303 health sciences
Information retrieval
Alias
business.industry
Computer science
Knowledge Bases
Health Informatics
Health records
Semantics
Computer Science Applications
Terminology
03 medical and health sciences
0302 clinical medicine
Knowledge base
Electronic health record
Electronic Health Records
Embedding
Graph (abstract data type)
030212 general & internal medicine
Language model
business
Natural Language Processing
030304 developmental biology
Subjects
Details
- ISSN :
- 15320464
- Volume :
- 113
- Database :
- OpenAIRE
- Journal :
- Journal of Biomedical Informatics
- Accession number :
- edsair.doi.dedup.....140ab6f7d50bc1a52fc5cbbe5d626990
- Full Text :
- https://doi.org/10.1016/j.jbi.2020.103628