Back to Search Start Over

Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF

Authors :
Chaofan Li
Kai Ma
Source :
Mathematical Biosciences and Engineering, Vol 19, Iss 3, Pp 2206-2218 (2022)
Publication Year :
2022
Publisher :
AIMS Press, 2022.

Abstract

Named entities are the main carriers of relevant medical knowledge in Electronic Medical Records (EMR). Clinical electronic medical records lead to problems such as word segmentation ambiguity and polysemy due to the specificity of Chinese language structure, so a Clinical Named Entity Recognition (CNER) model based on multi-head self-attention combined with BILSTM neural network and Conditional Random Fields is proposed. Firstly, the pre-trained language model organically combines char vectors and word vectors for the text sequences of the original dataset. The sequences are then fed into the parallel structure of the multi-head self-attention module and the BILSTM neural network module, respectively. By splicing the output of the neural network module to obtain multi-level information such as contextual information and feature association weights. Finally, entity annotation is performed by CRF. The results of the multiple comparison experiments show that the structure of the proposed model is very reasonable and robust, and it can effectively improve the Chinese CNER model. The model can extract multi-level and more comprehensive text features, compensate for the defect of long-distance dependency loss, with better applicability and recognition performance.

Details

Language :
English
ISSN :
15510018
Volume :
19
Issue :
3
Database :
Directory of Open Access Journals
Journal :
Mathematical Biosciences and Engineering
Publication Type :
Academic Journal
Accession number :
edsdoj.6c58a86453b34b19addf8721436c6274
Document Type :
article
Full Text :
https://doi.org/10.3934/mbe.2022103?viewType=HTML