Back to Search Start Over

Displacement-Invariant Cost Computation for Stereo Matching.

Authors :
Zhong, Yiran
Loop, Charles
Byeon, Wonmin
Birchfield, Stan
Dai, Yuchao
Zhang, Kaihao
Kamenev, Alexey
Breuel, Thomas
Li, Hongdong
Kautz, Jan
Source :
International Journal of Computer Vision. May2022, Vol. 130 Issue 5, p1196-1209. 14p.
Publication Year :
2022

Abstract

Although deep learning-based methods have dominated stereo matching leaderboards by yielding unprecedented disparity accuracy, their inference time is typically slow, i.e., less than 4 FPS for a pair of 540p images. The main reason is that the leading methods employ time-consuming 3D convolutions applied to a 4D feature volume. A common way to speed up the computation is to downsample the feature volume, but this loses high-frequency details. To overcome these challenges, we propose a displacement-invariant cost computation module to compute the matching costs without needing a 4D feature volume. Rather, costs are computed by applying the same 2D convolution network on each disparity-shifted feature map pair independently. Unlike previous 2D convolution-based methods that simply perform context mapping between inputs and disparity maps, our proposed approach learns to match features between the two images. We also propose an entropy-based refinement strategy to refine the computed disparity map, which further improves the speed by avoiding the need to compute a second disparity map on the right image. Extensive experiments on standard datasets (SceneFlow, KITTI, ETH3D, and Middlebury) demonstrate that our method achieves competitive accuracy with much less inference time. On typical image sizes (e.g., 540 × 960 ), our method processes over 100 FPS on a desktop GPU, making our method suitable for time-critical applications such as autonomous driving. We also show that our approach generalizes well to unseen datasets, outperforming 4D-volumetric methods. We will release the source code to ensure the reproducibility. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09205691
Volume :
130
Issue :
5
Database :
Academic Search Index
Journal :
International Journal of Computer Vision
Publication Type :
Academic Journal
Accession number :
156548740
Full Text :
https://doi.org/10.1007/s11263-022-01595-8