Back to Search
Start Over
Fully Sparse Fusion for 3D Object Detection.
- Source :
-
IEEE transactions on pattern analysis and machine intelligence [IEEE Trans Pattern Anal Mach Intell] 2024 Nov; Vol. 46 (11), pp. 7217-7231. Date of Electronic Publication: 2024 Oct 03. - Publication Year :
- 2024
-
Abstract
- Currently prevalent multi-modal 3D detection methods rely on dense detectors that usually use dense Bird's-Eye-View (BEV) feature maps. However, the cost of such BEV feature maps is quadratic to the detection range, making it not scalable for long-range detection. Recently, LiDAR-only fully sparse architecture has been gaining attention for its high efficiency in long-range perception. In this paper, we study how to develop a multi-modal fully sparse detector. Specifically, our proposed detector integrates the well-studied 2D instance segmentation into the LiDAR side, which is parallel to the 3D instance segmentation part in the LiDAR-only baseline. The proposed instance-based fusion framework maintains full sparsity while overcoming the constraints associated with the LiDAR-only fully sparse detector. Our framework showcases state-of-the-art performance on the widely used nuScenes dataset, Waymo Open Dataset, and the long-range Argoverse 2 dataset. Notably, the inference speed of our proposed method under the long-range perception setting is 2.7× faster than that of other state-of-the-art multimodal 3D detection methods.
Details
- Language :
- English
- ISSN :
- 1939-3539
- Volume :
- 46
- Issue :
- 11
- Database :
- MEDLINE
- Journal :
- IEEE transactions on pattern analysis and machine intelligence
- Publication Type :
- Academic Journal
- Accession number :
- 38648139
- Full Text :
- https://doi.org/10.1109/TPAMI.2024.3392303