Back to Search Start Over

ToC-RWG: Explore the Combination of Topic Model and Citation Information for Automatic Related Work Generation

Authors :
Shasha Li
Ting Wang
Pancheng Wang
Haifang Zhou
Jintao Tang
Source :
IEEE Access, Vol 8, Pp 13043-13055 (2020)
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

Automatic related work generation is a new challenge in multi-document scientific summarization focusing on refining a related work section for a given scientific paper. In this paper, we propose a brand new framework ToC-RWG for related work generation by incorporating topic model and citation information. We present an unsupervised generative probabilistic model, called QueryTopicSum, which utilizes a LDA-style model to characterize the generative process of both the scientific paper and its reference papers. We also take advantage of citations of reference papers to identify Cited Text Spans (CTS) from reference papers. This approach provides us with a perspective of annotating the importance of the reference papers from the academic community. With QueryTopicSum and the identified CTS as candidate sentences, an optimization framework based on minimizing KL divergence is exerted to select the most representative sentences for related work generation. Our evaluation results on a set of 50 scientific papers along with their corresponding reference papers show that ToC-RWG achieves a considerable improvement over generic multi-document summarization and scientific summarization baselines.

Details

Language :
English
ISSN :
21693536
Volume :
8
Database :
OpenAIRE
Journal :
IEEE Access
Accession number :
edsair.doi.dedup.....f8c1787673189fe10bdd6b2694ce295e