Back to Search
Start Over
3DV: 3D Dynamic Voxel for Action Recognition in Depth Video
- Source :
- CVPR
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) is proposed as a novel 3D motion representation. With 3D space voxelization, the key idea of 3DV is to encode 3D motion information within depth video into a regular voxel set (i.e., 3DV) compactly, via temporal rank pooling. Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly. 3DV is then abstracted as a point set and input into PointNet++ for 3D action recognition, in the end-to-end learning way. The intuition for transferring 3DV into the point set form is that, PointNet++ is lightweight and effective for deep feature learning towards point set. Since 3DV may lose appearance clue, a multi-stream 3D action recognition manner is also proposed to learn motion and appearance feature jointly. To extract richer temporal order information of actions, we also divide the depth video into temporal splits and encode this procedure in 3DV integrally. The extensive experiments on 4 well-established benchmark datasets demonstrate the superiority of our proposition. Impressively, we acquire the accuracy of 82.4% and 93.5% on NTU RGB+D 120 [13] with the cross-subject and crosssetup test setting respectively. 3DV's code is available at https://github.com/3huo/3DV-Action.<br />Comment: Accepted by CVPR2020
- Subjects :
- FOS: Computer and information sciences
business.industry
Computer science
Computer Vision and Pattern Recognition (cs.CV)
Computer Science - Computer Vision and Pattern Recognition
Pattern recognition
02 engineering and technology
Solid modeling
010501 environmental sciences
computer.software_genre
01 natural sciences
Voxel
0202 electrical engineering, electronic engineering, information engineering
RGB color model
Action recognition
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
Feature learning
0105 earth and related environmental sciences
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- Accession number :
- edsair.doi.dedup.....b94c9b8ca41554c2cd8ad1979c3356ac
- Full Text :
- https://doi.org/10.1109/cvpr42600.2020.00059