Back to Search
Start Over
GMFAD: Towards Generalized Visual Recognition via Multilayer Feature Alignment and Disentanglement
- Source :
- IEEE Transactions on Pattern Analysis and Machine Intelligence. 44:1289-1303
- Publication Year :
- 2022
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2022.
-
Abstract
- The deep learning based approaches which have been repeatedly proven to bring benefits to visual recognition tasks usually make a strong assumption that the training and test data are drawn from similar feature spaces and distributions. However, such an assumption may not always hold in various practical application scenarios on visual recognition tasks. Inspired by the hierarchical organization of deep feature representation that progressively leads to more abstract features at higher layers of representations, we propose to tackle this problem with a novel feature learning framework, which is called GMFAD, with better generalization capability in a multilayer perceptron manner. We first learn feature representations at the shallow layer where shareable underlying factors among domains (e.g., a subset of which could be relevant for each particular domain) can be explored. In particular, we propose to align the domain divergence between domain pair(s) by considering both inter-dimension and inter-sample correlations, which have been largely ignored by many cross-domain visual recognition methods. Subsequently, to learn more abstract information which could further benefit transferability, we propose to conduct feature disentanglement at the deep feature layer. Extensive experiments based on different visual recognition tasks demonstrate that our proposed framework can learn better transferable feature representation compared with state-of-the-art baselines. Nanyang Technological University This research was supported in part by the Wallenberg-NTU Presidential Postdoctoral Fellowship, the NTU-PKU Joint Research Institute, a collaboration between the Nanyang Technological University and Peking University that is sponsored by a donation from the Ng Teng Fong Charitable Foundation, the Science and Technology Foundation of Guangzhou Huangpu Development District under Grant 2017GH22 and 201902 010028, and Sino-Singapore International Joint Research Institute (Project No. 206-A017023 and 206-A018001).
- Subjects :
- Computer science
Generalization
02 engineering and technology
Machine learning
computer.software_genre
Domain (software engineering)
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
Feature (machine learning)
Representation (mathematics)
Divergence (statistics)
Generalization Capability
business.industry
Applied Mathematics
Perceptron
Computational Theory and Mathematics
Electrical and electronic engineering [Engineering]
Computer science and engineering [Engineering]
Covariance Matrix
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Artificial intelligence
business
Feature learning
computer
Software
Subjects
Details
- ISSN :
- 19393539 and 01628828
- Volume :
- 44
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Pattern Analysis and Machine Intelligence
- Accession number :
- edsair.doi.dedup.....1d01fea8596d183ae8294c102831c727