Back to Search
Start Over
CTHD-Net: CNN-Transformer hybrid dehazing network via residual global attention and gated boosting strategy.
- Source :
-
Journal of Visual Communication & Image Representation . Mar2024, Vol. 99, pN.PAG-N.PAG. 1p. - Publication Year :
- 2024
-
Abstract
- • A novel Residual Global Attention (RGA) to flexibly focus on different types of information, such as thin haze regions and thick haze regions. • A CNN-Transformer architecture to capture both detail texture information and global context information. • A Gated Strength-Operation-Subtract Boosting (GSOS) strategy to effectively integrate cross-layer feature information, thereby overcoming the problem of incomplete noise removal and detail loss. Single image dehazing is one of crucial tasks in the field of computer vision. However, existing methods are challenged on how to handle unevenly distributed haze, capture global contextual information, and filter noise while preserving details. To overcome these limitations, a novel dehazing network with residual global attention and gated boosting strategy based on a CNN-Transformer hybrid architecture (CTHD-Net) is proposed. Firstly, a feature encoder with a residual global attention (RGA) module is presented to improve the representation capability of the entire network by adaptively assigning different weights to feature maps. Subsequently, a CNN-Transformer hybrid architecture is designed to enhance the features encoding via the improved Swin-transformer and to capture the long-range dependencies among features by shifted-window Multi-head Self-attention. Finally, an effective Gated Strength-Operation-Subtract (GSOS) Boosting decoder is developed to reuse the key information required for image reconstruction in the shallow features, while effectively preventing haze noise. Extensive evaluation demonstrates that our proposed CTHD-Net significantly outperforms the previous state-of-the-arts in terms of quantity and quality. The source code has been made available at https://github.com/RC-Qiao/CTHD-Net. [ABSTRACT FROM AUTHOR]
- Subjects :
- *HAZE
*TEXTURE analysis (Image processing)
*COMPUTER vision
*NOISE
*IMAGE processing
Subjects
Details
- Language :
- English
- ISSN :
- 10473203
- Volume :
- 99
- Database :
- Academic Search Index
- Journal :
- Journal of Visual Communication & Image Representation
- Publication Type :
- Academic Journal
- Accession number :
- 175871424
- Full Text :
- https://doi.org/10.1016/j.jvcir.2024.104066