Back to Search Start Over

Improving Coverage and Novelty of Abstractive Text Summarization Using Transfer Learning and Divide and Conquer Approaches.

Authors :
Alomari, Ayham
Idris, Norisma
Md Sabri, Aznul Qalid
Alsmadi, Izzat
Source :
Malaysian Journal of Computer Science; 2023, Vol. 36 Issue 3, p1-17, 17p
Publication Year :
2023

Abstract

Automatic Text Summarization (ATS) models yield outcomes with insufficient coverage of crucial details and poor degrees of novelty. The first issue resulted from the lengthy input, while the second problem resulted from the characteristics of the training dataset itself. This research employs the divide-and-conquer approach to address the first issue by breaking the lengthy input into smaller pieces to be summarized, followed by the conquest of the results in order to cover more significant details. For the second challenge, these chunks are summarized by models trained on datasets with higher novelty levels in order to produce more human-like and concise summaries with more novel words that do not appear in the input article. The results demonstrate an improvement in both coverage and novelty levels. Moreover, we defined a new metric to measure the novelty of the summary. Finally, the findings led us to conclude that the novelty levels are more significantly influenced by the training dataset itself, as in CNN/DM, than by other factors like the training model or its training objective, as in Pegasus. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
TEXT summarization

Details

Language :
English
ISSN :
01279084
Volume :
36
Issue :
3
Database :
Supplemental Index
Journal :
Malaysian Journal of Computer Science
Publication Type :
Academic Journal
Accession number :
173596078
Full Text :
https://doi.org/10.22452/mjcs.vol36no3.4