Back to Search Start Over

A survey of deep learning-based visual question answering

Authors :
Xue-jiao Yang
Yu-ling Yang
Tong-yuan Huang
Source :
Journal of Central South University. 28:728-746
Publication Year :
2021
Publisher :
Springer Science and Business Media LLC, 2021.

Abstract

With the warming up and continuous development of machine learning, especially deep learning, the research on visual question answering field has made significant progress, with important theoretical research significance and practical application value. Therefore, it is necessary to summarize the current research and provide some reference for researchers in this field. This article conducted a detailed and in-depth analysis and summarized of relevant research and typical methods of visual question answering field. First, relevant background knowledge about VQA(Visual Question Answering) was introduced. Secondly, the issues and challenges of visual question answering were discussed, and at the same time, some promising discussion on the particular methodologies was given. Thirdly, the key sub-problems affecting visual question answering were summarized and analyzed. Then, the current commonly used data sets and evaluation indicators were summarized. Next, in view of the popular algorithms and models in VQA research, comparison of the algorithms and models was summarized and listed. Finally, the future development trend and conclusion of visual question answering were prospected.

Details

ISSN :
22275223 and 20952899
Volume :
28
Database :
OpenAIRE
Journal :
Journal of Central South University
Accession number :
edsair.doi...........bd563c8f65d0b56b38b3900836d5e4db