1. Zero-Shot Learning via Structure-Aligned Generative Adversarial Network
- Author
-
Jiancheng Lv, Zhenan He, Yunxia Li, and Chenwei Tang
- Subjects
Computer Networks and Communications ,business.industry ,Computer science ,Visual space ,Machine learning ,computer.software_genre ,Minimax ,Computer Science Applications ,Domain (software engineering) ,Artificial Intelligence ,Softmax function ,Embedding ,Artificial intelligence ,business ,computer ,Classifier (UML) ,Software ,Semantic gap ,Generator (mathematics) - Abstract
In this article, we propose a structure-aligned generative adversarial network framework to improve zero-shot learning (ZSL) by mitigating the semantic gap, domain shift, and hubness problem. The proposed framework contains two parts, i.e., a generative adversarial network with a softmax classifier part, and a structure-aligned part. In the first part, the generative adversarial network aims at generating pseudovisual features through the guiding generator and discriminator play the minimax two-player game together. At the same time, the softmax classifier is committed to increasing the interclass distance and reducing intraclass distance. Then, the harmful effect of domain shift and hubness problems can be mitigated. In another part, we introduce a structure-aligned module where the structural consistency between visual space and semantic space is learned. By aligning the structure between visual space and semantic space, the semantic gap between them can be bridged. The performance of classification is improved when the structure-aligned visual-semantic embedding space is transferred to the unseen classes. Our framework reformulates the ZSL as a standard fully supervised classification task using the pseudovisual features of unseen classes. Extensive experiments conducted on five benchmark data sets demonstrate that the proposed framework significantly outperforms state-of-the-art methods in both conventional and generalized settings.
- Published
- 2022
- Full Text
- View/download PDF