Back to Search Start Over

YOLOPX: Anchor-free multi-task learning network for panoptic driving perception.

Authors :
Zhan, Jiao
Luo, Yarong
Guo, Chi
Wu, Yejun
Meng, Jiawei
Liu, Jingnan
Source :
Pattern Recognition. Apr2024, Vol. 148, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Panoptic driving perception encompasses traffic object detection, drivable area segmentation, and lane detection. Existing methods typically utilize anchor-based multi-task learning networks to complete this task. While these methods yield promising results, they suffer from the inherent limitations of anchor-based detectors. In this paper, we propose YOLOPX, a simple and efficient anchor-free multi-task learning network for panoptic driving perception. To the best of our knowledge, this is the first work to employ the anchor-free detection head in panoptic driving perception. This anchor-free manner simplifies training by avoiding anchor-related heuristic tuning, and enhances the adaptability and scalability of our multi-task learning network. In addition, YOLOPX incorporates a novel lane detection head that combines multi-scale high-resolution features and long-distance contextual dependencies to improve segmentation performance. Beyond structure optimization, we propose optimization improvements to enhance network training, enabling our multi-task learning network to achieve optimal performance through simple end-to-end training. Experimental results on the challenging BDD100K dataset demonstrate the state-of-the-art (SOTA) performance of YOLOPX: it achieves 93.7% recall and 83.3% mAP50 on traffic object detection, 93.2% mIoU on drivable area segmentation, and 88.6% accuracy and 27.2% IoU on lane detection. Moreover, YOLOPX has faster inference speed compared to the lightweight network YOLOP. Consequently, YOLOPX is a powerful solution for panoptic driving perception problems. The code is available at https://github.com/jiaoZ7688/YOLOPX. • A novel anchor-free multi-task learning network for panoptic driving perception. • Lane detection utilizing multi-scale high-resolution features and long-distance contextual dependencies. • Several optimization improvements for efficient end-to-end training. • Extensive experiments demonstrate the effectiveness of YOLOPX by achieving excellent performance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00313203
Volume :
148
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
174791776
Full Text :
https://doi.org/10.1016/j.patcog.2023.110152