Back to Search Start Over

Keyphrase Extraction by Improving TextRank with an Integration of Word Embedding and Syntactic Information

Authors :
Feng Yukun
Luo Qi
Zhang Sheng
Xia Jingbo
Ma Xiaohang
Ding Ke
Gifu Daniela
Zhang Silan
Source :
Recent Advances in Computer Science and Communications. 14:2969-2975
Publication Year :
2021
Publisher :
Bentham Science Publishers Ltd., 2021.

Abstract

Background: As a known key phrase extraction algorithm, TextRank is an analogue of PageRank algorithm, which relied heavily on the statistics of term frequency in the manner of co-occurrence analysis. Objective: The frequency-based characteristic made it a neck-bottle for performance enhancement, and various improved TextRank algorithms were proposed in the recent years. Most of improvements incorporated semantic information into key phrase extraction algorithm and achieved improvement. Method: In this research, taking both syntactic and semantic information into consideration, we integrated syntactic tree algorithm and word embedding and put forward an algorithm of Word Embedding and Syntactic Information Algorithm (WESIA), which improved the accuracy of the TextRank algorithm. Results: By applying our method on a self-made test set and a public test set, the result implied that the proposed unsupervised key phrase extraction algorithm outperformed the other algorithms to some extent.

Details

ISSN :
26662558
Volume :
14
Database :
OpenAIRE
Journal :
Recent Advances in Computer Science and Communications
Accession number :
edsair.doi...........bd08e0511cbdc65fb49f1557b8f2bf5c