Back to Search Start Over

基于SOM 聚类的微博话题发现.

Authors :
宋莉娜
冯旭鹏
刘利军
黄青松
Source :
Application Research of Computers / Jisuanji Yingyong Yanjiu. Mar2018, Vol. 35 Issue 3, p372-679. 5p.
Publication Year :
2018

Abstract

With the increase of microblog users, the information of microblog platform is updating frequently. This paper proposed microblog topics detection based on SOM clustering for the features of the microblog text data sparseness, new words and non-standard words. Firstly, it pretreated the short texts from the primitive text corpus, and extracted the features of the short texts by the word vector model which reduced the computational burden caused by the high vector dimension. In order to reduce the large amount of computation just to the high vector dimensions, this paper extracted the short text feature extraction by word vector model. Then, the topic clustering could be achieved by an improved SOM clustering. The algorithm improved the traditional texts clustering shortcoming. And the algorithm could find the topic effectively. Experimental results show that the algorithm' s comprehensive index F value is improved obviously than the traditional methods. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10013695
Volume :
35
Issue :
3
Database :
Academic Search Index
Journal :
Application Research of Computers / Jisuanji Yingyong Yanjiu
Publication Type :
Academic Journal
Accession number :
128302516
Full Text :
https://doi.org/10.3969/j.issn.1001-3695.2018.03.007