Back to Search Start Over

Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception

Authors :
He, Lei
Wang, Qiaoyi
Sun, Honglin
Xu, Qing
Gao, Bolin
Li, Shengbo Eben
Wang, Jianqiang
Li, Keqiang
Publication Year :
2024

Abstract

Visual bird's eye view (BEV) perception, due to its excellent perceptual capabilities, is progressively replacing costly LiDAR-based perception systems, especially in the realm of urban intelligent driving. However, this type of perception still relies on LiDAR data to construct ground truth databases, a process that is both cumbersome and time-consuming. Moreover, most massproduced autonomous driving systems are only equipped with surround camera sensors and lack LiDAR data for precise annotation. To tackle this challenge, we propose a fine-tuning method for BEV perception network based on visual 2D semantic perception, aimed at enhancing the model's generalization capabilities in new scene data. Considering the maturity and development of 2D perception technologies, our method significantly reduces the dependency on high-cost BEV ground truths and shows promising industrial application prospects. Extensive experiments and comparative analyses conducted on the nuScenes and Waymo public datasets demonstrate the effectiveness of our proposed method.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2409.05834
Document Type :
Working Paper