Start Over

Kinematics-aware spatial-temporal feature transform for 3D human pose estimation.

Authors :: Du, Songlin
Yuan, Zhiwei
Ikenaga, Takeshi
Source :: Pattern Recognition. Jun2024, Vol. 150, pN.PAG-N.PAG. 1p.
Publication Year :: 2024
Abstract: 3D human pose estimation plays an important role in various human-machine interactive applications, but how to effectively extract and represent the kinematical features of human body structure in video has always been a challenge. This paper presents some inspiring observations on the human body properties that hold heuristic patterns of human poses: 1) There is distinct temporal coherence in any kind of human pose; 2) there exist evident spatial and temporal correlations among local joints even though the human is doing complex actions. According to the observed patterns, a locally structured feature encoder and a spatial–temporal feature transform are proposed for kinematics-aware feature extraction and enhancement. Unlike existing works directly projecting every bone joint to pose features without distinction, the proposed locally-structured feature encoder maps the local connection property of human body structure to kinematical features which are neural embeddings extracted from both local and global groups of human bone joints. Since the local and global bone-joint groups are pre-defined according to human body kinematics, the kinematical features are able to represent body kinematics. The kinematical features are then transformed by the proposed spatial–temporal feature transform to enhance the spatial and temporal correlations among human bone joints. The overall framework well promotes the representation of human body kinematics for 3D pose estimation. Extensive experimental results on commonly used datasets show that the mean per joint position error (MPJPE) is significantly reduced when compared with state-of-the-art methods under the same experimental condition. The improvement is expected to promote machines to better understand human poses for building superior human-centered automation systems. • Spatial–temporal kinematic-awareness is studied for 3D human pose estimation. • Hybrid-kinematical feature encoder extracts kinematical features of 2D pose. • Spatial–temporal feature transform enhances the spatial and temporal correlations. • The fusion of spatial and temporal features promote the final 3D pose estimation. [ABSTRACT FROM AUTHOR]

Subjects :: *POSE estimation (Computer vision)
*HUMAN kinematics
*JOINTS (Anatomy)
*HUMAN body
*FEATURE extraction
*HUMAN beings

Details

Language :: English
ISSN :: 00313203
Volume :: 150
Database :: Academic Search Index
Journal :: Pattern Recognition
Publication Type :: Academic Journal
Accession number :: 175963857
Full Text :: https://doi.org/10.1016/j.patcog.2024.110316

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Kinematics-aware spatial-temporal feature transform for 3D human pose estimation.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Kinematics-aware spatial-temporal feature transform for 3D human pose estimation.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources