Back to Search Start Over

DeepKG: an end-to-end deep learning-based workflow for biomedical knowledge graph extraction, optimization and applications.

Authors :
Li Z
Zhong Q
Yang J
Duan Y
Wang W
Wu C
He K
Source :
Bioinformatics (Oxford, England) [Bioinformatics] 2022 Feb 07; Vol. 38 (5), pp. 1477-1479.
Publication Year :
2022

Abstract

Summary: DeepKG is an end-to-end deep learning-based workflow that helps researchers automatically mine valuable knowledge in biomedical literature. Users can utilize it to establish customized knowledge graphs in specified domains, thus facilitating in-depth understanding on disease mechanisms and applications on drug repurposing and clinical research. To improve the performance of DeepKG, a cascaded hybrid information extraction framework is developed for training model of 3-tuple extraction, and a novel AutoML-based knowledge representation algorithm (AutoTransX) is proposed for knowledge representation and inference. The system has been deployed in dozens of hospitals and extensive experiments strongly evidence the effectiveness. In the context of 144 900 COVID-19 scholarly full-text literature, DeepKG generates a high-quality knowledge graph with 7980 entities and 43 760 3-tuples, a candidate drug list, and relevant animal experimental studies are being carried out. To accelerate more studies, we make DeepKG publicly available and provide an online tool including the data of 3-tuples, potential drug list, question answering system, visualization platform.<br />Availability and Implementation: All the results are publicly available at the website (http://covidkg.ai/).<br />Supplementary Information: Supplementary data are available at Bioinformatics online.<br /> (© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1367-4811
Volume :
38
Issue :
5
Database :
MEDLINE
Journal :
Bioinformatics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
34788369
Full Text :
https://doi.org/10.1093/bioinformatics/btab767