251. RDRF-Net: A pyramid architecture network with residual-based dynamic receptive fields for unsupervised depth estimation.
- Author
-
Ji, Zhen-yan, Song, Xiao-jun, Song, Hou-bin, Yang, Hong, and Guo, Xiao-xuan
- Subjects
- *
PYRAMIDS , *COMPUTER vision , *BOTTLENECKS (Manufacturing) , *CONVOLUTIONAL neural networks , *MARKOV random fields - Abstract
[Display omitted] • In this paper, we present a RDRF-Net for depth estimation with high accuracy. • It introduces a dynamic adaptive selection method of receptive fields for higher accuracy in depth estimation. • It introduces a residual architecture of 3 × 3 kernels instead of parallel 3 × 3 and 5 × 5 kernels to simulate two receptive fields, which leads to model compression and less network parameters. • It introduces a bottleneck architecture to further compress the network without explicit accuracy loss. Image depth estimation is a challenging problem in computer vision, especially considering both high accuracy and low run time. To save run time and maintain high accuracy, we present a new light-weight model in this paper, i.e., a Residual-based Dynamic Receptive Field Network (RDRF-Net). This model can automatically select the receptive fields suitable for different image scales to generate the depth maps with higher fitting degrees. Residual design and bottleneck layers are used to compress the network for reducing run time. Three groups of experiments are performed on the KITTI dataset to test the accuracy, computation time, and the impact of dynamic receptive fields. Experimental results show that RDRF-Net has comparable accuracy with Godard's model and significantly outperforms it in terms of run time. In addition, it performs closely to Pyd-Net in terms of run time and beats Pyd-Net's accuracy. Experiments also demonstrate the beneficial impact of dynamic receptive fields on improving depth estimation accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF