Back to Search Start Over

VKP-P3D: Real-Time Monocular Pseudo 3D Object Detection Based on Visible Key Points and Camera Geometry

Authors :
Changliang Sun
Hongli Liu
Weichu Xiao
Bo Shi
Yuan Qiu
Source :
IEEE Access, Vol 12, Pp 41883-41895 (2024)
Publication Year :
2024
Publisher :
IEEE, 2024.

Abstract

Three-dimensional object detection has been substantially improved with the use of expensive LiDAR and stereo vision systems in intelligent driving. The less-expensive and more scalable solution of monocular 3D object detection, however, remains a key challenge. This study primarily explores real-time pseudo 3D object detection with monocular vision and designs a single-shot RPN model, VKP-P3D, which relies purely on visual feature extraction. Through a multiscale feature fusion and an attention mechanism module, this model obtains high-dimensional feature representations during the feature extraction phase. In the detection head of the VKP-P3D model, the pseudo 3D object detection is obtained by regressing 2D bounding box and the visible key points within the image coordinate of the 3D box from the camera’s perspective. Finally, assuming flat ground and considering geometric parameters of the camera, the object’s 3D information can be extracted. To verify the effectiveness of the proposed algorithm, we constructed two pseudo 3D object detection datasets based on visible key points and compared with current state-of-the-art real-time object detector. Results showed that the proposed model has high detection accuracy and speed.

Details

Language :
English
ISSN :
21693536
Volume :
12
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.0117e9632d414e7fb3bf9aa11bbc7769
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2024.3378105