1. Pose-Guided Multi-Scale Structural Relationship Learning for Video-Based Pedestrian Re-Identification
- Author
-
Dan Wei, Xiaoqiang Hu, Ziyang Wang, Jianglin Shen, and Hongjuan Ren
- Subjects
General Computer Science ,Computer science ,Image quality ,Feature extraction ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,02 engineering and technology ,relationship model ,Discriminative model ,0202 electrical engineering, electronic engineering, information engineering ,General Materials Science ,Spatial analysis ,business.industry ,Pedestrian re-identification ,Perspective (graphical) ,General Engineering ,020206 networking & telecommunications ,Pattern recognition ,multi-scale structure relationship ,Feature (computer vision) ,graph convolutional network ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,Artificial intelligence ,Scale (map) ,business ,lcsh:TK1-9971 - Abstract
How to extract discriminative features from redundant video information is a key issue for video pedestrian re-identification. Factors such as occlusion, perspective, and posture changes in complex environments pose severe challenges to pedestrian re-identification based on local methods. In this paper, a posture-guided multi-scale structural relationship learning pedestrian re-identification method is proposed. The purpose is to analyze the video sequence of pedestrians based on the reference pose and the pose alignment model, and extract the sample frame with the highest image quality and the most complete spatial information in the reference pose. The method based on posture guidance can more accurately eliminate the interference of background, occlusion and perspective factors. To further explore the potential relationship between local regions, this paper calculates the relationship matrix between the local regions based on the relationship model to further calculate the relationship weight, and the graph convolutional network based on the relationship weight learns the structural relationship feature of multi-scale regions. The input of the graph convolutional network is a local region divided by a multi-scale method, and the output is a pose-guided multi-scale structural relationship feature. The experimental results on three public datasets show that the proposed method performs favorably against state-of-the-art methods.
- Published
- 2021