Back to Search Start Over

EGFI: drug–drug interaction extraction and generation with fusion of enriched entity and sentence information.

Authors :
Huang, Lei
Lin, Jiecong
Li, Xiangtao
Song, Linqi
Zheng, Zetian
Wong, Ka-Chun
Source :
Briefings in Bioinformatics; Jan2022, Vol. 23 Issue 1, p1-14, 14p
Publication Year :
2022

Abstract

Motivation The rapid growth in literature accumulates diverse and yet comprehensive biomedical knowledge hidden to be mined such as drug interactions. However, it is difficult to extract the heterogeneous knowledge to retrieve or even discover the latest and novel knowledge in an efficient manner. To address such a problem, we propose EGFI for extracting and consolidating drug interactions from large-scale medical literature text data. Specifically, EGFI consists of two parts: classification and generation. In the classification part, EGFI encompasses the language model BioBERT which has been comprehensively pretrained on biomedical corpus. In particular, we propose the multihead self-attention mechanism and packed BiGRU to fuse multiple semantic information for rigorous context modeling. In the generation part, EGFI utilizes another pretrained language model BioGPT-2 where the generation sentences are selected based on filtering rules. Results We evaluated the classification part on 'DDIs 2013' dataset and 'DTIs' dataset, achieving the F1 scores of 0.842 and 0.720 respectively. Moreover, we applied the classification part to distinguish high-quality generated sentences and verified with the existing growth truth to confirm the filtered sentences. The generated sentences that are not recorded in DrugBank and DDIs 2013 dataset demonstrated the potential of EGFI to identify novel drug relationships. Availability Source code are publicly available at https://github.com/Layne-Huang/EGFI. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14675463
Volume :
23
Issue :
1
Database :
Complementary Index
Journal :
Briefings in Bioinformatics
Publication Type :
Academic Journal
Accession number :
155892334
Full Text :
https://doi.org/10.1093/bib/bbab451