Back to Search Start Over

关键语义信息补足的深度文本聚类算法.

Authors :
郑璐依
黄瑞章
任丽娜
白瑞娜
林川
Source :
Application Research of Computers / Jisuanji Yingyong Yanjiu. Jun2023, Vol. 40 Issue 6, p1653-1659. 7p.
Publication Year :
2023

Abstract

The most existing deep text clustering methods only use traditional autoencoder to learn representation for clustering, and neglect the problems with over-reliance on raw data quality and loss of key semantic information during feature mapping. This paper proposed a deep document clustering method via key semantic information complementation (DCKSC) model. The DCKSC model firstly enriched the original text data by extracting keyword data. Secondly, this model designed a key semantic information complement module which used data enhancement representation to improve the traditional autoencoder, and compensated for the key semantic information lost in the mapping process. Finally, the algorithm synthesized the clustering loss and the reconstruction loss of the keyword semantic autoencoder, optimized the cluster label assignment and learned the presentation characteristics suitable for clustering. Experimental results show that DCKSC is superior to many mainstream deep document clustering algorithms. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10013695
Volume :
40
Issue :
6
Database :
Academic Search Index
Journal :
Application Research of Computers / Jisuanji Yingyong Yanjiu
Publication Type :
Academic Journal
Accession number :
169823943
Full Text :
https://doi.org/10.19734/j.issn.1001-3695.2022.10.0555