Back to Search
Start Over
Learning content-aware feature fusion for guided depth map super-resolution.
- Source :
-
Signal Processing: Image Communication . Aug2024, Vol. 126, pN.PAG-N.PAG. 1p. - Publication Year :
- 2024
-
Abstract
- RGB-D data including paired RGB color images and depth maps is widely used in downstream computer vision tasks. However, compared with the acquisition of high-resolution color images, the depth maps captured by consumer-level sensors are always in low resolution. Within decades of research, the most state-of-the-art (SOTA) methods of depth map super-resolution cannot adaptively tune the guidance fusion for all feature positions by channel-wise feature concatenation with spatially sharing convolutional kernels. This paper proposes JTFNet to resolve this issue, which simulates the traditional Joint Trilateral Filter (JTF). Specifically, a novel JTF block is introduced to adaptively tune the fusion pattern between the color features and the depth features for all feature positions. Moreover, based on the variant of JTF block whose target features and guidance features are in the cross-scale shape, the fusion for depth features is performed in a bi-directional way. Therefore, the error accumulation along scales can be effectively mitigated by iteratively HR feature guidance. Compared with the SOTA methods, the sufficient experiment is conducted on the mainstream synthetic datasets and real datasets, i.e., Middlebury, NYU and ToF-Mark, which shows remarkable improvement of our JTFNet. • The Joint Trilateral Filter (JTF) block is proposed to adaptively tune the effect of guidance features. • We design two light subnetworks to learn kernel generation for color and depth features. • We propose Bidirectional Fusion blocks to fuse cross-scale depth feature based on the JTF block. • Our results on Middlebury, NYU and ToF-Mark shows the remarkable improvement. [ABSTRACT FROM AUTHOR]
- Subjects :
- *CONVOLUTIONAL neural networks
*DEPTH maps (Digital image processing)
Subjects
Details
- Language :
- English
- ISSN :
- 09235965
- Volume :
- 126
- Database :
- Academic Search Index
- Journal :
- Signal Processing: Image Communication
- Publication Type :
- Academic Journal
- Accession number :
- 177846585
- Full Text :
- https://doi.org/10.1016/j.image.2024.117140