Back to Search
Start Over
Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF
- Source :
- Mathematical Biosciences and Engineering, Vol 19, Iss 3, Pp 2206-2218 (2022)
- Publication Year :
- 2022
- Publisher :
- AIMS Press, 2022.
-
Abstract
- Named entities are the main carriers of relevant medical knowledge in Electronic Medical Records (EMR). Clinical electronic medical records lead to problems such as word segmentation ambiguity and polysemy due to the specificity of Chinese language structure, so a Clinical Named Entity Recognition (CNER) model based on multi-head self-attention combined with BILSTM neural network and Conditional Random Fields is proposed. Firstly, the pre-trained language model organically combines char vectors and word vectors for the text sequences of the original dataset. The sequences are then fed into the parallel structure of the multi-head self-attention module and the BILSTM neural network module, respectively. By splicing the output of the neural network module to obtain multi-level information such as contextual information and feature association weights. Finally, entity annotation is performed by CRF. The results of the multiple comparison experiments show that the structure of the proposed model is very reasonable and robust, and it can effectively improve the Chinese CNER model. The model can extract multi-level and more comprehensive text features, compensate for the defect of long-distance dependency loss, with better applicability and recognition performance.
Details
- Language :
- English
- ISSN :
- 15510018
- Volume :
- 19
- Issue :
- 3
- Database :
- Directory of Open Access Journals
- Journal :
- Mathematical Biosciences and Engineering
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.6c58a86453b34b19addf8721436c6274
- Document Type :
- article
- Full Text :
- https://doi.org/10.3934/mbe.2022103?viewType=HTML