Back to Search Start Over

Heterogeneous heatmap distillation framework based on unbiased alignment for lightweight human pose estimation.

Authors :
Du, Congju
Li, Zhenyu
Zhao, Huijuan
He, Shuangjiang
Yu, Li
Source :
Image & Vision Computing. Jun2024, Vol. 146, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

The growing demand for mobile devices has generated interest in lightweight human pose estimation. Currently, lightweight estimation generally uses heatmap-based methods, which has demonstrated exceptional performance. However, their use of non-differentiable post-processing imposes considerable inference latencies. Conversely, integral-based approaches expedite the inference process by employing a soft-argmax operation but compromise in accuracy. Integrating explicit heatmap knowledge learned using the heatmap-based method into the implicit heatmap generated by the integral-based method, thereby combining the best of both worlds, offers a promising avenue. However, owing to the disparities in supervision and inference processes, the explicit and implicit heatmaps are heterogeneous. Consequently, direct transfer of knowledge presents difficulties in ensuring consistencies in heat value and location. In this paper, we propose a novel Heterogeneous Heatmap Distillation (HHD) framework that effectively tackles these challenges. The framework seamlessly integrates explicit heatmap knowledge that contains high-precision localization information into implicit heatmaps. The framework revolves around an unbiased heatmap alignment scheme encompassing two steps: heterogeneous heatmap normalization and unbiased cropping. Heterogeneous heatmap normalization separately normalizes the output feature maps of both the teacher and student models, alleviating potential heat value bias during the knowledge transfer. Unbiased cropping applies closed-form computation on the normalized teacher and student heatmap to eliminate location bias. Additionally, mirror expansion is implemented to handle potential cases wherein the cropped region extends beyond the image boundary. Extensive experiments demonstrate the efficiency and effectiveness of our methods on the MSCOCO and MPII datasets compared to other integral-based lightweight networks. Our source codes and pre-trained models are available at https://github.com/ducongju/HHD. • We propose a heterogeneousheatmap distillation(HHD) forhuman pose estimation. • The HHDtransfers knowledge from explicit heatmaps to implicit heatmaps. • Heterogeneous heatmap normalization mitigatesthe heat value bias between heatmaps. • Unbiasedcropping and mirror expansion mitigatethe location bias between heatmaps. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02628856
Volume :
146
Database :
Academic Search Index
Journal :
Image & Vision Computing
Publication Type :
Academic Journal
Accession number :
177372785
Full Text :
https://doi.org/10.1016/j.imavis.2024.105041