1. A survey of deep learning-based visual question answering
- Author
-
Xue-jiao Yang, Yu-ling Yang, and Tong-yuan Huang
- Subjects
Computer science ,business.industry ,Deep learning ,Metals and Alloys ,General Engineering ,Theoretical research ,02 engineering and technology ,010501 environmental sciences ,01 natural sciences ,Data science ,Field (computer science) ,Metallic materials ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,Question answering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,0105 earth and related environmental sciences - Abstract
With the warming up and continuous development of machine learning, especially deep learning, the research on visual question answering field has made significant progress, with important theoretical research significance and practical application value. Therefore, it is necessary to summarize the current research and provide some reference for researchers in this field. This article conducted a detailed and in-depth analysis and summarized of relevant research and typical methods of visual question answering field. First, relevant background knowledge about VQA(Visual Question Answering) was introduced. Secondly, the issues and challenges of visual question answering were discussed, and at the same time, some promising discussion on the particular methodologies was given. Thirdly, the key sub-problems affecting visual question answering were summarized and analyzed. Then, the current commonly used data sets and evaluation indicators were summarized. Next, in view of the popular algorithms and models in VQA research, comparison of the algorithms and models was summarized and listed. Finally, the future development trend and conclusion of visual question answering were prospected.
- Published
- 2021