1. Collaborative Training of Gans in Continuous and Discrete Spaces for Text Generation
- Author
-
Yanghoon Kim, Seungpil Won, Seunghyun Yoon, and Kyomin Jung
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,General Computer Science ,Computer science ,Process (engineering) ,Computer Science - Artificial Intelligence ,Maximum likelihood ,010501 environmental sciences ,text GAN ,01 natural sciences ,Machine Learning (cs.LG) ,0502 economics and business ,Reinforcement learning ,General Materials Science ,Adversarial training ,050207 economics ,Representation (mathematics) ,collaborative training ,0105 earth and related environmental sciences ,Computer Science - Computation and Language ,business.industry ,Discrete space ,05 social sciences ,General Engineering ,Sampling (statistics) ,Autoencoder ,Manifold ,Artificial Intelligence (cs.AI) ,Benchmark (computing) ,Artificial intelligence ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,business ,Computation and Language (cs.CL) ,lcsh:TK1-9971 ,Natural language - Abstract
Applying generative adversarial networks (GANs) to text-related tasks is challenging due to the discrete nature of language. One line of research resolves this issue by employing reinforcement learning (RL) and optimizing the next-word sampling policy directly in a discrete action space. Such methods compute the rewards from complete sentences and avoid error accumulation due to exposure bias. Other approaches employ approximation techniques that map the text to continuous representation in order to circumvent the non-differentiable discrete process. Particularly, autoencoder-based methods effectively produce robust representations that can model complex discrete structures. In this article, we propose a novel text GAN architecture that promotes the collaborative training of the continuous-space and discrete-space methods. Our method employs an autoencoder to learn an implicit data manifold, providing a learning objective for adversarial training in a continuous space. Furthermore, the complete textual output is directly evaluated and updated via RL in a discrete space. The collaborative interplay between the two adversarial trainings effectively regularize the text representations in different spaces. The experimental results on three standard benchmark datasets show that our model substantially outperforms state-of-the-art text GANs with respect to quality, diversity, and global consistency.
- Published
- 2020