Author: "Li, Hongkang" / Language: english - Searchworks@Jio Institute Digital Library Search Results

1. A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity

Author: Li, Hongkang, Wang, Meng, Liu, Sijia, and Chen, Pin-yu
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Vision Transformers (ViTs) with self-attention modules have recently achieved great empirical success in many vision tasks. Due to non-convex interactions across layers, however, theoretical learning and generalization analysis is mostly elusive. Based on a data model characterizing both label-relevant and label-irrelevant tokens, this paper provides the first theoretical analysis of training a shallow ViT, i.e., one self-attention layer followed by a two-layer perceptron, for a classification task. We characterize the sample complexity to achieve a zero generalization error. Our sample complexity bound is positively correlated with the inverse of the fraction of label-relevant tokens, the token noise level, and the initial model error. We also prove that a training process using stochastic gradient descent (SGD) leads to a sparse attention map, which is a formal verification of the general intuition about the success of attention. Moreover, this paper indicates that a proper token sparsification can improve the test performance by removing label-irrelevant and/or noisy tokens, including spurious correlations. Empirical experiments on synthetic data and CIFAR-10 dataset justify our theoretical results and generalize to deeper ViTs.
Published: 2023

2. Colorimetric Aerogel Gas Sensor with High Sensitivity and Stability.

Author: Xia, Xiaoli, Wu, Ruonan, Zhang, Lei, Chen, Xiangyu, Yan, Yanling, Yin, Jikun, Ren, Jin, Li, Hongkang, Yin, Jinzhong, Xue, Zhenjie, Yi, Lanhua, and Wang, Tie
Published: 2023
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Li, Hongkang"'

1. A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity

2. Colorimetric Aerogel Gas Sensor with High Sensitivity and Stability.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

2 results on '"Li, Hongkang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources