Back to Search Start Over

Repmono: a lightweight self-supervised monocular depth estimation architecture for high-speed inference

Authors :
Guowei Zhang
Xincheng Tang
Li Wang
Huankang Cui
Teng Fei
Hulin Tang
Shangfeng Jiang
Source :
Complex & Intelligent Systems, Vol 10, Iss 6, Pp 7927-7941 (2024)
Publication Year :
2024
Publisher :
Springer, 2024.

Abstract

Abstract Self-supervised monocular depth estimation has always attracted attention because it does not require ground truth data. Designing a lightweight architecture capable of fast inference is crucial for deployment on mobile devices. The current network effectively integrates Convolutional Neural Networks (CNN) with Transformers, achieving significant improvements in accuracy. However, this advantage comes at the cost of an increase in model size and a significant reduction in inference speed. In this study, we propose a network named Repmono, which includes LCKT module with a large convolutional kernel and RepTM module based on the structural reparameterisation technique. With the combination of these two modules, our network achieves both local and global feature extraction with a smaller number of parameters and significantly enhances inference speed. Our network, with 2.31MB parameters, shows significant accuracy improvements over Monodepth2 in experiments on the KITTI dataset. With uniform input dimensions, our network’s inference speed is 53.7% faster than R-MSFM6, 60.1% faster than Monodepth2, and 81.1% faster than MonoVIT-small. Our code is available at https://github.com/txc320382/Repmono .

Details

Language :
English
ISSN :
21994536 and 21986053
Volume :
10
Issue :
6
Database :
Directory of Open Access Journals
Journal :
Complex & Intelligent Systems
Publication Type :
Academic Journal
Accession number :
edsdoj.0e7d2d375d754ea3b5cecb687d7df956
Document Type :
article
Full Text :
https://doi.org/10.1007/s40747-024-01575-0