Back to Search Start Over

Balancing stability and plasticity when learning topic models from short and noisy text streams.

Authors :
Nguyen, Tung
Mai, Trung
Nguyen, Nam
Van, Linh Ngo
Than, Khoat
Source :
Neurocomputing. Sep2022, Vol. 505, p30-43. 14p.
Publication Year :
2022

Abstract

Learning hidden topics from short text streams has become crucial in modern applications such as social networks, instant messages, question and answer forums, etc. Building an effective learning method poses two main challenges: Short and noisy data, as well as the stability-plasticity dilemma. In this paper, we investigate carefully how existing methods face these challenges. From our theoretical and empirical analyses, they often deal well with a challenge but ineffectively handle the other. In particular, they often suffer from catastrophic forgetting, because they impose a constraint on the learned knowledge from only the previous time step in the streaming duration. In this paper, we propose a novel method, namely BSP, which has a regularization term based on second-order Taylor expansion to accumulate information from all former minibatches. Moreover, external knowledge and Dropout technique will be combined at each time step to handle better short and noisy texts as well as enhance the model's plasticity. We empirically evaluate BSP, compared to other state-of-the-art streaming methods in terms of dealing with stability-plasticity dilemma and handling short and noisy texts. The extensive experiments show superior effectiveness of BSP. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
505
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
158481274
Full Text :
https://doi.org/10.1016/j.neucom.2022.07.019