Back to Search
Start Over
Region ensemble network: Towards good practices for deep 3D hand pose estimation.
- Source :
-
Journal of Visual Communication & Image Representation . Aug2018, Vol. 55, p404-414. 11p. - Publication Year :
- 2018
-
Abstract
- Highlights • Propose a novel <bold> Region Ensemble Network </bold> for end-to-end hand pose estimation. • Present several good practices of ConvNets for hand pose estimation. • The Proposed method outperforms state-of-the-arts on hand and human pose datasets. 3D hand pose estimation is an important and challenging problem for human-computer interaction. Recently convolutional networks (ConvNet) with sophisticated design have been employed to address it, but the improvement is not so significant. To exploit good practice and promote the performance for hand pose estimation, we propose a Region Ensemble Network (REN) for directly 3D coordinate regression. It first partitions the last convolutional outputs of ConvNet into several grid regions. Results from separate fully-connected (FC) regressors on each regions are integrated by another FC layer to perform estimation. By exploitation of several training strategies including data augmentation and smooth L 1 loss, REN significantly improves the performance of ConvNet for hand pose estimation. Experiments demonstrate that our approach achieves strong performance on par or better than state-of-the-art algorithms on three public hand pose datasets. We also experiment our methods on fingertip detection and human pose datasets and obtain state-of-the-art accuracy. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 10473203
- Volume :
- 55
- Database :
- Academic Search Index
- Journal :
- Journal of Visual Communication & Image Representation
- Publication Type :
- Academic Journal
- Accession number :
- 131628580
- Full Text :
- https://doi.org/10.1016/j.jvcir.2018.04.005