Back to Search
Start Over
Using word embedding to detect keywords in texts modeled as complex networks.
- Source :
- Scientometrics; Jul2024, Vol. 129 Issue 7, p3599-3623, 25p
- Publication Year :
- 2024
-
Abstract
- Detecting keywords in texts is a task of paramount importance for many text mining applications. Graph-based techniques have been commonly used to automatically find the key concepts in texts. However, the integration of valuable information provided by embeddings to enrich the graph structure has not been widely used. In this context, this paper aims to address the following question: can the quality of extracted keywords from a co-occurrence network be enhanced by integrating embeddings to enrich the network structure? In the adopted model, texts are represented as co-occurrence networks, where nodes are words and edges are established either by contextual or semantical similarity. Two embedding approaches were used: Word2vec and Bidirectional Encoder Representations from Transformers (BERT). The results indicate that using virtual edges can effectively enhance the discriminative capacity of co-occurrence networks. The best performance was achieved by incorporating a limited proportion of virtual (embedding) edges. A comparison of the structural and dynamical network metrics demonstrated that the degree, PageRank, and accessibility metrics exhibited superior performance in the proposed model. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 01389130
- Volume :
- 129
- Issue :
- 7
- Database :
- Complementary Index
- Journal :
- Scientometrics
- Publication Type :
- Academic Journal
- Accession number :
- 178656518
- Full Text :
- https://doi.org/10.1007/s11192-024-05055-7