Back to Search
Start Over
A sequence to sequence model for dialogue generation with gated mixture of topics
- Source :
- Neurocomputing. 437:282-288
- Publication Year :
- 2021
- Publisher :
- Elsevier BV, 2021.
-
Abstract
- In this paper, we propose GMoT-Seq2Seq, a sequence to sequence (Seq2Seq) model with a gated mixture of topics (MoT) designed to utilize topic information to generate fluent and coherent responses. Seq2Seq model is good at capturing the local structure of word sequence which affects the fluency due to their sequential nature, but probably has difficulty to extract topic information from the utterance. In contrast, topic models are very capable of capturing global semantic information that has a direct impact on the coherence. Absorbing the advantages of both, the proposed GMoT-Seq2Seq model uses a Seq2Seq to capture the temporal dependencies, and an MoT layer to obtain the topic vector that provides global semantic dependencies in the conversation. The MoT layer can summarize the utterances into a proportion vector over several underlying topics. To balance the fluency and coherence, we utilize a topic gate to dynamically control the information from the inferred topic vector and the partially generated responses. Experiment results show that our proposed model outperforms the compared baselines, and can generate more fluent and coherent responses.
- Subjects :
- Topic model
0209 industrial biotechnology
Sequence
Computer science
business.industry
Cognitive Neuroscience
Contrast (statistics)
02 engineering and technology
Coherence (statistics)
computer.software_genre
Computer Science Applications
Fluency
020901 industrial engineering & automation
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Artificial intelligence
Layer (object-oriented design)
business
computer
Natural language processing
Word (computer architecture)
Utterance
Subjects
Details
- ISSN :
- 09252312
- Volume :
- 437
- Database :
- OpenAIRE
- Journal :
- Neurocomputing
- Accession number :
- edsair.doi...........5fd84f2647a1741c9f44edaf1ed596b4