Back to Search
Start Over
YOLOPX: Anchor-free multi-task learning network for panoptic driving perception.
- Source :
-
Pattern Recognition . Apr2024, Vol. 148, pN.PAG-N.PAG. 1p. - Publication Year :
- 2024
-
Abstract
- Panoptic driving perception encompasses traffic object detection, drivable area segmentation, and lane detection. Existing methods typically utilize anchor-based multi-task learning networks to complete this task. While these methods yield promising results, they suffer from the inherent limitations of anchor-based detectors. In this paper, we propose YOLOPX, a simple and efficient anchor-free multi-task learning network for panoptic driving perception. To the best of our knowledge, this is the first work to employ the anchor-free detection head in panoptic driving perception. This anchor-free manner simplifies training by avoiding anchor-related heuristic tuning, and enhances the adaptability and scalability of our multi-task learning network. In addition, YOLOPX incorporates a novel lane detection head that combines multi-scale high-resolution features and long-distance contextual dependencies to improve segmentation performance. Beyond structure optimization, we propose optimization improvements to enhance network training, enabling our multi-task learning network to achieve optimal performance through simple end-to-end training. Experimental results on the challenging BDD100K dataset demonstrate the state-of-the-art (SOTA) performance of YOLOPX: it achieves 93.7% recall and 83.3% mAP50 on traffic object detection, 93.2% mIoU on drivable area segmentation, and 88.6% accuracy and 27.2% IoU on lane detection. Moreover, YOLOPX has faster inference speed compared to the lightweight network YOLOP. Consequently, YOLOPX is a powerful solution for panoptic driving perception problems. The code is available at https://github.com/jiaoZ7688/YOLOPX. • A novel anchor-free multi-task learning network for panoptic driving perception. • Lane detection utilizing multi-scale high-resolution features and long-distance contextual dependencies. • Several optimization improvements for efficient end-to-end training. • Extensive experiments demonstrate the effectiveness of YOLOPX by achieving excellent performance. [ABSTRACT FROM AUTHOR]
- Subjects :
- *TRAFFIC monitoring
*IMAGE segmentation
Subjects
Details
- Language :
- English
- ISSN :
- 00313203
- Volume :
- 148
- Database :
- Academic Search Index
- Journal :
- Pattern Recognition
- Publication Type :
- Academic Journal
- Accession number :
- 174791776
- Full Text :
- https://doi.org/10.1016/j.patcog.2023.110152