1. Toward an end-to-end implicit addressee modeling for dialogue disentanglement.
- Author
-
Gao, Jingsheng, Li, Zeyu, Xiang, Suncheng, Wang, Zhuowei, Liu, Ting, and Fu, Yuzhuo
- Subjects
TRAINING manuals ,ANNOTATIONS ,CONVERSATION ,CLASSIFICATION - Abstract
Multi-party conversations are a practical and challenging scenario with more than two sessions entangled with each other. Therefore, it is necessary to disentangle a whole conversation into several sessions to help listeners decide which session each utterance is part of to respond to it appropriately. This task is referred to as dialogue disentanglement. Most existing methods focus on message-pair modeling and clustering in two-step methods, which are sensitive to the noise classification pairs and result in poor clustering performance. To address this challenge, we propose a contrastive learning framework named IAM for end-to-end implicit addressee modeling. To be more specific, IAM makes utterances in different sessions mutually exclusive to identify the sessions of utterances better. Then a clustering method is adopted to generate predicted clustering labels. Moreover, to alleviate the lack of massive annotated data, we introduce a strategy to select pseudo samples for unsupervised training without manual annotations. Comprehensive experiments conducted on the Movie Dialogue and IRC datasets demonstrate that IAM achieves state-of-the-art in both supervised and unsupervised manners. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF