Back to Search
Start Over
GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval.
- Source :
-
Neural Computing & Applications . Dec2022, Vol. 34 Issue 23, p21387-21401. 15p. - Publication Year :
- 2022
-
Abstract
- In this paper, a new fine-grained image classification (FGIC) network with feature relationship enhancement of multiple stages is established. After the engaging of scene text in FGIC and retrieval, basic architecture of local, global, text feature encoders and classifier have been approved. This method retains these portions and expands them into a five-module architecture. In specific, positional encoding is incorporated to both local and textual feature encoders such that complementary information carried could engage in feature representation. In local and textual feature encoders, intra-modal semantic relation reasoning is introduced for FGIC by a proposed General Feature Relation Enhancement (GFRE) module. GFRE is a feature reasoning module applicable to any two inputs of same modality or distinct modalities. GFRE adopts Graph Attention which represents and infers relationships among graph data. Moreover, latest multi-modal reasoning module is improved by a proposed Multi-Head Multi-Modal Joint Semantic Reasoning module consisted of cross-modal GFREs by multi-head fusion. Experimental results on multiple datasets verify the effectiveness of the proposed algorithm. [ABSTRACT FROM AUTHOR]
- Subjects :
- *IMAGE retrieval
*ARTIFICIAL neural networks
*GRAPH algorithms
*VIDEO coding
Subjects
Details
- Language :
- English
- ISSN :
- 09410643
- Volume :
- 34
- Issue :
- 23
- Database :
- Academic Search Index
- Journal :
- Neural Computing & Applications
- Publication Type :
- Academic Journal
- Accession number :
- 160074208
- Full Text :
- https://doi.org/10.1007/s00521-022-07617-3