Start Over

YOLOPX: Anchor-free multi-task learning network for panoptic driving perception.

Authors :: Zhan, Jiao
Luo, Yarong
Guo, Chi
Wu, Yejun
Meng, Jiawei
Liu, Jingnan
Source :: Pattern Recognition. Apr2024, Vol. 148, pN.PAG-N.PAG. 1p.
Publication Year :: 2024
Abstract: Panoptic driving perception encompasses traffic object detection, drivable area segmentation, and lane detection. Existing methods typically utilize anchor-based multi-task learning networks to complete this task. While these methods yield promising results, they suffer from the inherent limitations of anchor-based detectors. In this paper, we propose YOLOPX, a simple and efficient anchor-free multi-task learning network for panoptic driving perception. To the best of our knowledge, this is the first work to employ the anchor-free detection head in panoptic driving perception. This anchor-free manner simplifies training by avoiding anchor-related heuristic tuning, and enhances the adaptability and scalability of our multi-task learning network. In addition, YOLOPX incorporates a novel lane detection head that combines multi-scale high-resolution features and long-distance contextual dependencies to improve segmentation performance. Beyond structure optimization, we propose optimization improvements to enhance network training, enabling our multi-task learning network to achieve optimal performance through simple end-to-end training. Experimental results on the challenging BDD100K dataset demonstrate the state-of-the-art (SOTA) performance of YOLOPX: it achieves 93.7% recall and 83.3% mAP50 on traffic object detection, 93.2% mIoU on drivable area segmentation, and 88.6% accuracy and 27.2% IoU on lane detection. Moreover, YOLOPX has faster inference speed compared to the lightweight network YOLOP. Consequently, YOLOPX is a powerful solution for panoptic driving perception problems. The code is available at https://github.com/jiaoZ7688/YOLOPX. • A novel anchor-free multi-task learning network for panoptic driving perception. • Lane detection utilizing multi-scale high-resolution features and long-distance contextual dependencies. • Several optimization improvements for efficient end-to-end training. • Extensive experiments demonstrate the effectiveness of YOLOPX by achieving excellent performance. [ABSTRACT FROM AUTHOR]

Subjects :: *TRAFFIC monitoring
*IMAGE segmentation

Details

Language :: English
ISSN :: 00313203
Volume :: 148
Database :: Academic Search Index
Journal :: Pattern Recognition
Publication Type :: Academic Journal
Accession number :: 174791776
Full Text :: https://doi.org/10.1016/j.patcog.2023.110152

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

YOLOPX: Anchor-free multi-task learning network for panoptic driving perception.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

YOLOPX: Anchor-free multi-task learning network for panoptic driving perception.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources