Back to Search Start Over

Random Indexing and Modified Random Indexing based approach for extractive text summarization.

Authors :
Chatterjee, Niladri
Sahoo, Pramod Kumar
Source :
Computer Speech & Language. Jan2015, Vol. 29 Issue 1, p32-44. 13p.
Publication Year :
2015

Abstract

Random Indexing based extractive text summarization has already been proposed in literature. This paper looks at the above technique in detail, and proposes several improvements. The improvements are both in terms of formation of index (word) vectors of the document, and construction of context vectors by using convolution instead of addition operation on the index vectors. Experiments have been conducted using both angular and linear distances as metrics for proximity. As a consequence, three improved versions of the algorithm, viz. RISUM, RISUM+ and MRISUM were obtained. These algorithms have been applied on DUC 2002 documents, and their comparative performance has been studied. Different ROUGE metrics have been used for performance evaluation. While RISUM and RISUM+ perform almost at par, MRISUM is found to outperform both RISUM and RISUM+ significantly. MRISUM also outperforms LSA+TRM based summarization approach. The study reveals that all the three Random Indexing based techniques proposed in this study produce consistent results when linear distance is used for measuring proximity. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08852308
Volume :
29
Issue :
1
Database :
Academic Search Index
Journal :
Computer Speech & Language
Publication Type :
Academic Journal
Accession number :
99211955
Full Text :
https://doi.org/10.1016/j.csl.2014.07.001