Back to Search Start Over

Self-supervised monocular depth estimation with self-distillation and dense skip connection.

Authors :
Xiang, Xuezhi
Li, Wei
Wang, Yao
El Saddik, Abdulmotaleb
Source :
Computer Vision & Image Understanding; Sep2024, Vol. 246, pN.PAG-N.PAG, 1p
Publication Year :
2024

Abstract

Monocular depth estimation (MDE) is crucial in a wide range of applications, including robotics, autonomous driving and virtual reality. Self-supervised monocular depth estimation has emerged as a promising MDE approach without requiring hard-to-obtain depth labels during training, and multi-scale photometric loss is widely used for self-supervised monocular depth estimation as the self-supervised signal. However, multi-photometric loss is a weak training signal and might disturb the good intermediate features representation. In this paper, we propose a successive depth map self-distillation(SDM-SD) loss, which combines with the single-scale photometric loss to replace the multi-scale photometric loss. Moreover, considering that multi-stage feature representations are essential for dense prediction tasks such as depth estimation, we also propose a dense skip connection, which can efficiently fuse the intermediate features of the encoder and fully utilize them in each stage of the decoder in our encoder–decoder architecture. By applying successive depth map self-distillation loss and dense skip connection, our proposed method can achieve state-of-the-art performance on the KITTI benchmark, and exhibit the best generalization ability on the challenging indoor dataset NYUv2 dataset. • We propose a successive depth map self-distillation loss for self-supervised monocular depth estimation. • We propose a dense skip connection strategy to improve the depth estimation effect of the depth network. • We validate the proposed method's effectiveness on the KITTI dataset and NYUv2 dataset, achieving state-of-the-art performance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10773142
Volume :
246
Database :
Supplemental Index
Journal :
Computer Vision & Image Understanding
Publication Type :
Academic Journal
Accession number :
178478331
Full Text :
https://doi.org/10.1016/j.cviu.2024.104048