Back to Search
Start Over
Adversarial Multimodal Network for Movie Story Question Answering
- Source :
- IEEE Transactions on Multimedia. 23:1744-1756
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- Visual question answering by using information from multiple modalities has attracted more and more attention in recent years. However, it is a very challenging task, as the visual content and natural language have quite different statistical properties. In this work, we present a method called Adversarial Multimodal Network (AMN) to better understand video stories for question answering. In AMN, we propose to learn multimodal feature representations by finding a more coherent subspace for video clips and the corresponding texts (e.g., subtitles and questions) based on generative adversarial networks. Moreover, a self-attention mechanism is developed to enforce our newly introduced consistency constraint in order to preserve the self-correlation between the visual cues of the original video clips in the learned multimodal representations. Extensive experiments on the benchmark MovieQA and TVQA datasets show the effectiveness of our proposed AMN over other published state-of-the-art methods.
- Subjects :
- Computer science
business.industry
02 engineering and technology
computer.software_genre
Computer Science Applications
Visualization
Consistency (database systems)
Knowledge extraction
Signal Processing
0202 electrical engineering, electronic engineering, information engineering
Media Technology
Question answering
Task analysis
Feature (machine learning)
020201 artificial intelligence & image processing
Artificial intelligence
Electrical and Electronic Engineering
business
computer
Sensory cue
Natural language
Natural language processing
Subjects
Details
- ISSN :
- 19410077 and 15209210
- Volume :
- 23
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Multimedia
- Accession number :
- edsair.doi...........d43f4312c17f546eede750b690d9fc0c