Back to Search
Start Over
Hybrid attention transformer with re-parameterized large kernel convolution for image super-resolution.
- Source :
-
Image & Vision Computing . Sep2024, Vol. 149, pN.PAG-N.PAG. 1p. - Publication Year :
- 2024
-
Abstract
- Single image super-resolution is a well-established low-level vision task that aims to reconstruct high-resolution images from low-resolution images. Methods based on Transformer have shown remarkable success and achieved outstanding performance in SISR tasks. While Transformer effectively models global information, it is less effective at capturing high frequencies such as stripes that primarily provide local information. Additionally, it has the potential to further enhance the capture of global information. To tackle this, we propose a novel Large Kernel Hybrid Attention Transformer using re-parameterization. It combines different kernel sizes and different steps re-parameterized convolution layers with Transformer to effectively capture global and local information to learn comprehensive features with low-frequency and high-frequency information. Moreover, in order to solve the problem of using batch normalization layer to introduce artifacts in SISR, we propose a new training strategy which is fusing convolution layer and batch normalization layer after certain training epochs. This strategy can enjoy the acceleration convergence effect of batch normalization layer in training and effectively eliminate the problem of artifacts in the inference stage. For re-parameterization of multiple parallel branch convolution layers, adopting this strategy can further reduce the amount of calculation of training. By coupling these core improvements, our LKHAT achieves state-of-the-art performance for single image super-resolution task. [Display omitted] • The proposed module takes into account both high-frequency and low-frequency information of the image • The proposed module is conducive to generating clear texture details • The training strategy we proposed helps to reduce artifacts [ABSTRACT FROM AUTHOR]
- Subjects :
- *HIGH resolution imaging
*PROBLEM solving
*TASK performance
*STRIPES
Subjects
Details
- Language :
- English
- ISSN :
- 02628856
- Volume :
- 149
- Database :
- Academic Search Index
- Journal :
- Image & Vision Computing
- Publication Type :
- Academic Journal
- Accession number :
- 179030466
- Full Text :
- https://doi.org/10.1016/j.imavis.2024.105162