Back to Search
Start Over
基于SOM 聚类的微博话题发现.
- Source :
-
Application Research of Computers / Jisuanji Yingyong Yanjiu . Mar2018, Vol. 35 Issue 3, p372-679. 5p. - Publication Year :
- 2018
-
Abstract
- With the increase of microblog users, the information of microblog platform is updating frequently. This paper proposed microblog topics detection based on SOM clustering for the features of the microblog text data sparseness, new words and non-standard words. Firstly, it pretreated the short texts from the primitive text corpus, and extracted the features of the short texts by the word vector model which reduced the computational burden caused by the high vector dimension. In order to reduce the large amount of computation just to the high vector dimensions, this paper extracted the short text feature extraction by word vector model. Then, the topic clustering could be achieved by an improved SOM clustering. The algorithm improved the traditional texts clustering shortcoming. And the algorithm could find the topic effectively. Experimental results show that the algorithm' s comprehensive index F value is improved obviously than the traditional methods. [ABSTRACT FROM AUTHOR]
Details
- Language :
- Chinese
- ISSN :
- 10013695
- Volume :
- 35
- Issue :
- 3
- Database :
- Academic Search Index
- Journal :
- Application Research of Computers / Jisuanji Yingyong Yanjiu
- Publication Type :
- Academic Journal
- Accession number :
- 128302516
- Full Text :
- https://doi.org/10.3969/j.issn.1001-3695.2018.03.007