Back to Search
Start Over
基于多路语义图网络的图像自动问答.
- Source :
-
Application Research of Computers / Jisuanji Yingyong Yanjiu . Feb2023, Vol. 40 Issue 2, p383-387. 5p. - Publication Year :
- 2023
-
Abstract
- Recently, image question answering based on the fusion of visual features and text features has become one of the hot research issues of automatic question answering. Most of the existing models are based on the attention mechanism to explore the relationship between the image and the question sentence, which ignores the correlation between the image area and the question words in the same mode and different views. To solve these problems, this paper proposed an image question answering model (MSGN) based on multi-view semantic graph network, which could mine the semantic correlation between images and questions from multiple views. Meanwhile, it used the graph neural network model to mine the fine-grained intra and inter-modal correlation between image regions and question words. It carried out extensive experiments on public data sets. The experimental results show that the image automatic question answering model based on multi-view semantic graph network can improve the performance of image question answering. [ABSTRACT FROM AUTHOR]
- Subjects :
- *PROBLEM solving
*MULTICASTING (Computer networks)
Subjects
Details
- Language :
- Chinese
- ISSN :
- 10013695
- Volume :
- 40
- Issue :
- 2
- Database :
- Academic Search Index
- Journal :
- Application Research of Computers / Jisuanji Yingyong Yanjiu
- Publication Type :
- Academic Journal
- Accession number :
- 162018054
- Full Text :
- https://doi.org/10.19734/j.issn.1001-3695.2022.06.0335