Back to Search
Start Over
Enhance the 3D Object Detection With 2D Prior
- Source :
- IEEE Access, Vol 12, Pp 67161-67169 (2024)
- Publication Year :
- 2024
- Publisher :
- IEEE, 2024.
-
Abstract
- In the modern field of multi-view 3D detection, interest has seen a notable rise. Existing methods are predominantly focused on constructing dense BEV (Bird’s Eye View) features or utilizing sparse queries for detection. In this paper, we synergize the methodologies of sparse queries with dense BEV feature representation and integrate robust 2D detection capabilities to propose an innovative structure. Specifically, we leverage the results of 2D detection and their predicted depth to generate a series of sparse queries endowed with strong semantic and positional priors. These queries demonstrate enhanced capability in capturing challenging cases in 3D detection. Additionally, we employ 2D detection and depth information to discern critical foreground BEV queries. This strategy enables a efficient and noiseless feature aggregation, leveraging multi-view image features. Our methodology extends to a comprehensive global modeling of all BEV queries, ensuring the derivation of premium quality BEV features. Furthermore, we enhance BEV features through temporal modeling that differentiates between static and dynamic objects. Our proposed approach achieves state-of-the-art results on the nuScenes test set with 60.4%mAP and 51.7%NDS. Code will be available.
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 12
- Database :
- Directory of Open Access Journals
- Journal :
- IEEE Access
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.06ade185f0354b1c9e32a92a7d8f7040
- Document Type :
- article
- Full Text :
- https://doi.org/10.1109/ACCESS.2024.3398373