Back to Search Start Over

Learning content-aware feature fusion for guided depth map super-resolution.

Authors :
Zuo, Yifan
Wang, Hao
Xu, Yaping
Huang, Huimin
Huang, Xiaoshui
Xia, Xue
Fang, Yuming
Source :
Signal Processing: Image Communication. Aug2024, Vol. 126, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

RGB-D data including paired RGB color images and depth maps is widely used in downstream computer vision tasks. However, compared with the acquisition of high-resolution color images, the depth maps captured by consumer-level sensors are always in low resolution. Within decades of research, the most state-of-the-art (SOTA) methods of depth map super-resolution cannot adaptively tune the guidance fusion for all feature positions by channel-wise feature concatenation with spatially sharing convolutional kernels. This paper proposes JTFNet to resolve this issue, which simulates the traditional Joint Trilateral Filter (JTF). Specifically, a novel JTF block is introduced to adaptively tune the fusion pattern between the color features and the depth features for all feature positions. Moreover, based on the variant of JTF block whose target features and guidance features are in the cross-scale shape, the fusion for depth features is performed in a bi-directional way. Therefore, the error accumulation along scales can be effectively mitigated by iteratively HR feature guidance. Compared with the SOTA methods, the sufficient experiment is conducted on the mainstream synthetic datasets and real datasets, i.e., Middlebury, NYU and ToF-Mark, which shows remarkable improvement of our JTFNet. • The Joint Trilateral Filter (JTF) block is proposed to adaptively tune the effect of guidance features. • We design two light subnetworks to learn kernel generation for color and depth features. • We propose Bidirectional Fusion blocks to fuse cross-scale depth feature based on the JTF block. • Our results on Middlebury, NYU and ToF-Mark shows the remarkable improvement. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09235965
Volume :
126
Database :
Academic Search Index
Journal :
Signal Processing: Image Communication
Publication Type :
Academic Journal
Accession number :
177846585
Full Text :
https://doi.org/10.1016/j.image.2024.117140