1. Extracting fact-condition relation from geological papers via deep structured semantic model with multi-grained representation.
- Author
-
Chen, Qizhi, Yao, Hong, Zhou, Diange, Li, Shengwen, and Dong, Lijun
- Subjects
- *
KNOWLEDGE graphs , *DATA mining - Abstract
The fact-condition statement plays a significant role in the geological papers, where geological theory or phenomenon as a fact is constrained by its precondition. This constraint relation conveys a lot of key information in geological papers, but it has been ignored in most of studies on the information extraction. Although the rule-based algorithms have been proposed in the previous studies to extract the relationship between facts and conditions, but their low accuracy limit usability. Therefore, this paper introduces a series of mechanisms and methods to improve the extraction accuracy. We firstly utilize three kinds of pooling for extracting the semantic features of fact-condition tuples to find that balanced semantic representations can more accurately reflect their relations. In addition, semantic correlation between facts and conditions is also designed to mine their relations via the measurement of semantic similarity or semantic distance. Based on the above, the Deep Structured Semantic Model with Multi-Grained Representation (MGR-DSSM) is proposed. In this model, the Deep Structured Semantic Model (DSSM) is utilized two individual projections for denoising semantic correlation between facts and conditions, while the Multi-Grained Representation (MGR) is designed for encoding their semantic correlation at multiple granularities. Furthermore, the linear MGR-DSSM with SciBERT achieves the best result with 70.41% of accuracy and 74.55% of F1 score, obviously outperforming other baseline models. Finally, the linear MGR-DSSM with SciBERT is adopted to extract super relation from 722 geological paper abstracts, and the geological knowledge graph with fact-condition statement is built. With more accurate relation between fact and condition, the information validity of geological knowledge graph is enhanced. • The relation between facts and conditions play an essential role in the geological paper, the extraction of which is studied for the first time. • A series of mechanisms are proposed to improve the accuracy of fact-condition relation extraction. • The MGR-DSSM is proposed, which obviously outperforms other baselines in super relation extraction. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF