Back to Search Start Over

HDA-pose: a real-time 2D human pose estimation method based on modified YOLOv8.

Authors :
Dong, Chengang
Tang, Yuhao
Zhang, Liyan
Source :
Signal, Image & Video Processing; Sep2024, Vol. 18 Issue 8/9, p5823-5839, 17p
Publication Year :
2024

Abstract

2D human pose estimation aims to accurately regress the keypoints of human body from images or videos. However, it remains challenging due to the occlusion and intersection among multiple individuals and the difficulty of dealing with different body scales. In order to better tackle these issues, we propose a human pose estimation framework named HDA-Pose. By improving the real-time framework of YOLOv8, we achieve simultaneous regression of all individuals' keypoint locations in the image. Specifically, we propose the High-Grade Dual Attention (HDA) module to further enhance the focus of YOLOv8 on important features of individuals in the image. Additionally, we improve the original data augmentation strategy in YOLOv8 to better simulate cases where key points of individuals are occluded in the image. Lastly, we introduce a novel regression loss metric, Vertex Intersection over Union, to further enhance the effectiveness of the model in multi-person pose estimation. Our approach attains competitive results on multiple metrics of two open-source datasets, MS COCO 2017 and CrowdPose. Compared with the baseline model YOLOv8x-pose, HDA-Pose improves the average precision by 2.9% and 3.3% on the two datasets, respectively. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
18631703
Volume :
18
Issue :
8/9
Database :
Complementary Index
Journal :
Signal, Image & Video Processing
Publication Type :
Academic Journal
Accession number :
178679134
Full Text :
https://doi.org/10.1007/s11760-024-03274-2