Back to Search Start Over

Haar wavelet downsampling: A simple but effective downsampling module for semantic segmentation.

Authors :
Xu, Guoping
Liao, Wentao
Zhang, Xuan
Li, Chang
He, Xinwei
Wu, Xinglong
Source :
Pattern Recognition. Nov2023, Vol. 143, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• We propose a novel Wavelet-based downsampling module (HWD) for CNNs. To the best of our knowledge, our method is the first attempt to explore feasibility by prohibiting (impeding) information loss in the downsampling stage of DCNNs for the semantic segmentation task. • We explore the measurement of information uncertainty across layers in CNNs, and propose a novel metric, named Feature Entropy Index (FEI), to evaluate the information uncertainty or feature importance between the downsampled feature maps and the prediction results. • The proposed HWD can be directly replaced the strided convolution or pooling layer without significant increase of computation overhead and be easily integrated into the current segmentation architectures. Comprehensive experiments demonstrate the effectiveness of the HWD module when comparing with seven state-of-the-art segmentation methods. Downsampling operations such as max pooling or strided convolution are ubiquitously utilized in Convolutional Neural Networks (CNNs) to aggregate local features, enlarge receptive field, and minimize computational overhead. However, for a semantic segmentation task, pooling features over the local neighbourhood may result in the loss of important spatial information, which is conducive for pixel-wise predictions. To address this issue, we introduce a simple yet effective pooling operation called the Haar Wavelet-based Downsampling (HWD) module. This module can be easily integrated into CNNs to enhance the performance of semantic segmentation models. The core idea of HWD is to apply Haar wavelet transform for reducing the spatial resolution of feature maps while preserving as much information as possible. Furthermore, to investigate the benefits of HWD, we propose a novel metric, named as feature entropy index (FEI), which measures the degree of information uncertainty after downsampling in CNNs. Specifically, the FEI can be used to indicate the ability of downsampling methods to preserve essential information in semantic segmentation. Our comprehensive experiments demonstrate that the proposed HWD module could (1) effectively improve the segmentation performance across different modality image datasets with various CNN architectures, and (2) efficiently reduce information uncertainty compared to the conventional downsampling methods. Our implementation are available at https://github.com/apple1986/HWD. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00313203
Volume :
143
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
171109926
Full Text :
https://doi.org/10.1016/j.patcog.2023.109819