Start Over

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

Authors :: Xiuwen Liu
Zhe He
Daniel Biś
Canlin Zhang
Source :: BMC Bioinformatics, Vol 20, Iss S16, Pp 1-15 (2019), BMC Bioinformatics
Publication Year :: 2019
Publisher :: BMC, 2019.
Abstract: Background In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses. Results In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained “universal” models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a “hint”. The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy. Conclusion Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.

Subjects :: Support Vector Machine
020205 medical informatics
Biomedical
Computer science
Word sense disambiguation
02 engineering and technology
Self-attention
computer.software_genre
lcsh:Computer applications to medicine. Medical informatics
Vocabulary
Biochemistry
03 medical and health sciences
Structural Biology
0202 electrical engineering, electronic engineering, information engineering
Humans
Molecular Biology
lcsh:QH301-705.5
Natural Language Processing
030304 developmental biology
Structure (mathematical logic)
0303 health sciences
Artificial neural network
business.industry
Research
Applied Mathematics
Deep learning
Information processing
Computer Science Applications
Random forest
Support vector machine
lcsh:Biology (General)
Embedding
lcsh:R858-859.7
Neural Networks, Computer
Artificial intelligence
business
LSTM
computer
Algorithms
Word (computer architecture)
Natural language processing

Details

Language :: English
ISSN :: 14712105
Volume :: 20
Database :: OpenAIRE
Journal :: BMC Bioinformatics
Accession number :: edsair.doi.dedup.....1fe1f09ab9856f41410d8a5952b81b18

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources