Author: "Zhai, Xiaohua" / Journal: 2022 ieee/cvf conference on computer vision and pattern recognition (cvpr) - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhai, Xiaohua"' showing total 3 results

Start Over Author "Zhai, Xiaohua" Journal 2022 ieee/cvf conference on computer vision and pattern recognition (cvpr)

3 results on '"Zhai, Xiaohua"'

1. Scaling Vision Transformers

Author: Zhai, Xiaohua, Kolesnikov, Alexander, Houlsby, Neil, and Beyer, Lucas
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Attention-based neural networks such as the Vision Transformer (ViT) have recently attained state-of-the-art results on many computer vision benchmarks. Scale is a primary ingredient in attaining excellent results, therefore, understanding a model's scaling properties is a key to designing future generations effectively. While the laws for scaling Transformer language models have been studied, it is unknown how Vision Transformers scale. To address this, we scale ViT models and data, both up and down, and characterize the relationships between error rate, data, and compute. Along the way, we refine the architecture and training of ViT, reducing memory consumption and increasing accuracy of the resulting models. As a result, we successfully train a ViT model with two billion parameters, which attains a new state-of-the-art on ImageNet of 90.45% top-1 accuracy. The model also performs well for few-shot transfer, for example, reaching 84.86% top-1 accuracy on ImageNet with only 10 examples per class., Comment: Xiaohua, Alex, and Lucas contributed equally; CVPR 2022
Published: 2022
Full Text: View/download PDF

2. LiT: Zero-Shot Transfer with Locked-image text Tuning

Author: Zhai, Xiaohua, Wang, Xiao, Mustafa, Basil, Steiner, Andreas, Keysers, Daniel, Kolesnikov, Alexander, and Beyer, Lucas
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Computation and Language (cs.CL), Machine Learning (cs.LG)
Abstract: This paper presents contrastive-tuning, a simple method employing contrastive training to align image and text models while still taking advantage of their pre-training. In our empirical study we find that locked pre-trained image models with unlocked text models work best. We call this instance of contrastive-tuning "Locked-image Tuning" (LiT), which just teaches a text model to read out good representations from a pre-trained image model for new tasks. A LiT model gains the capability of zero-shot transfer to new vision tasks, such as image classification or retrieval. The proposed LiT is widely applicable; it works reliably with multiple pre-training methods (supervised and unsupervised) and across diverse architectures (ResNet, Vision Transformers and MLP-Mixer) using three different image-text datasets. With the transformer-based pre-trained ViT-g/14 model, the LiT model achieves 85.2% zero-shot transfer accuracy on the ImageNet test set, and 82.5% on the challenging out-of-distribution ObjectNet test set., Comment: Xiaohua, Xiao, Basil, Andreas and Lucas contributed equally; CVPR 2022
Published: 2022
Full Text: View/download PDF

3. Knowledge distillation: A good teacher is patient and consistent

Author: Beyer, Lucas, primary, Zhai, Xiaohua, additional, Royer, Amelie, additional, Markeeva, Larisa, additional, Anil, Rohan, additional, and Kolesnikov, Alexander, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Zhai, Xiaohua"'

1. Scaling Vision Transformers

2. LiT: Zero-Shot Transfer with Locked-image text Tuning

3. Knowledge distillation: A good teacher is patient and consistent

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

3 results on '"Zhai, Xiaohua"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources