Back to Search
Start Over
Improving Coverage and Novelty of Abstractive Text Summarization Using Transfer Learning and Divide and Conquer Approaches.
- Source :
- Malaysian Journal of Computer Science; 2023, Vol. 36 Issue 3, p1-17, 17p
- Publication Year :
- 2023
-
Abstract
- Automatic Text Summarization (ATS) models yield outcomes with insufficient coverage of crucial details and poor degrees of novelty. The first issue resulted from the lengthy input, while the second problem resulted from the characteristics of the training dataset itself. This research employs the divide-and-conquer approach to address the first issue by breaking the lengthy input into smaller pieces to be summarized, followed by the conquest of the results in order to cover more significant details. For the second challenge, these chunks are summarized by models trained on datasets with higher novelty levels in order to produce more human-like and concise summaries with more novel words that do not appear in the input article. The results demonstrate an improvement in both coverage and novelty levels. Moreover, we defined a new metric to measure the novelty of the summary. Finally, the findings led us to conclude that the novelty levels are more significantly influenced by the training dataset itself, as in CNN/DM, than by other factors like the training model or its training objective, as in Pegasus. [ABSTRACT FROM AUTHOR]
- Subjects :
- TEXT summarization
Subjects
Details
- Language :
- English
- ISSN :
- 01279084
- Volume :
- 36
- Issue :
- 3
- Database :
- Supplemental Index
- Journal :
- Malaysian Journal of Computer Science
- Publication Type :
- Academic Journal
- Accession number :
- 173596078
- Full Text :
- https://doi.org/10.22452/mjcs.vol36no3.4