Back to Search Start Over

Knowledge triple mining via multi-task learning.

Authors :
Zhang, Zhao
Zhuang, Fuzhen
Li, Xuebing
Niu, Zheng-Yu
He, Jia
He, Qing
Xiong, Hui
Source :
Information Systems. Feb2019, Vol. 80, p64-75. 12p.
Publication Year :
2019

Abstract

Abstract Recent years have witnessed the rapid development of knowledge bases (KBs) such as WordNet, Yago and DBpedia, which are useful resources in AI-related applications. However, most of the existing KBs are suffering from incompleteness and manually adding knowledge into KBs is inefficient. Therefore, automatically mining knowledge becomes a critical issue. To this end, in this paper, we propose to develop a model (S 2 AMT) to extract knowledge triples, such as < Barack Obama, wife, Michelle Obama > , from the Internet and add them to KBs to support many downstream applications. Particularly, because the seed instances 1 1 In this paper, seed instances refer to labeled positive instances. for every relation is difficult to obtain, our model is capable of mining knowledge triples with limited available seed instances. To be more specific, we treat the knowledge triple mining task for each relation as a single task and use multi-task learning (MTL) algorithms to solve the problem, because MTL algorithms can often get better results than single-task learning (STL) ones with limited training data. Moreover, since finding proper task groups is a fatal problem in MTL which can directly influences the final results, we adopt a clustering algorithm to find proper task groups to further improve the performance. Finally, we conduct extensive experiments on real-world data sets and the experimental results clearly validate the performance of our MTL algorithms against STL ones. Highlights • We propose S 2 AMT to solve the problem of KTM with limited seed instances. • Our framework obtain better performance using MTL methods. • Our framework jointly use labeled and unlabeled instances during the training stage. • We give a fast method to find related tasks to further improve the performance. • Our work provides a new perspective for KTM when having limited seed instances. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
03064379
Volume :
80
Database :
Academic Search Index
Journal :
Information Systems
Publication Type :
Academic Journal
Accession number :
133439505
Full Text :
https://doi.org/10.1016/j.is.2018.09.003