Back to Search
Start Over
ToC-RWG: Explore the Combination of Topic Model and Citation Information for Automatic Related Work Generation
- Source :
- IEEE Access, Vol 8, Pp 13043-13055 (2020)
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- Automatic related work generation is a new challenge in multi-document scientific summarization focusing on refining a related work section for a given scientific paper. In this paper, we propose a brand new framework ToC-RWG for related work generation by incorporating topic model and citation information. We present an unsupervised generative probabilistic model, called QueryTopicSum, which utilizes a LDA-style model to characterize the generative process of both the scientific paper and its reference papers. We also take advantage of citations of reference papers to identify Cited Text Spans (CTS) from reference papers. This approach provides us with a perspective of annotating the importance of the reference papers from the academic community. With QueryTopicSum and the identified CTS as candidate sentences, an optimization framework based on minimizing KL divergence is exerted to select the most representative sentences for related work generation. Our evaluation results on a set of 50 scientific papers along with their corresponding reference papers show that ToC-RWG achieves a considerable improvement over generic multi-document summarization and scientific summarization baselines.
- Subjects :
- Topic model
Information retrieval
General Computer Science
Computer science
Perspective (graphical)
cited text spans
General Engineering
Statistical model
Automatic summarization
Set (abstract data type)
scientific summarization
General Materials Science
lcsh:Electrical engineering. Electronics. Nuclear engineering
Citation
topic model
lcsh:TK1-9971
Generative grammar
Automatic related work generation
Subjects
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....f8c1787673189fe10bdd6b2694ce295e