HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from 
a Single Depth Map

Malik, Jameel; Abdelaziz, Ibrahim; Elhayek, Ahmed; Shimada, Soshi; Ali, Sk Aziz; Golyanik, Vladislav; Theobalt, Christian; Stricker, Didier

DetailsSummary

HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from a Single Depth Map

Malik, J., Abdelaziz, I., Elhayek, A., Shimada, S., Ali, S. A., Golyanik, V., et al. (2020). HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation from a Single Depth Map. Retrieved from https://arxiv.org/abs/2004.01588.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0007-E0FF-D Version Permalink: https://hdl.handle.net/21.11116/0000-0007-E100-A

Genre: Paper

Latex : {HandVoxNet}: {D}eep Voxel-Based Network for {3D} Hand Shape and Pose Estimation from a Single Depth Map

Files

show Files

hide Files

:

arXiv:2004.01588.pdf (Preprint), 3MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0007-E101-9

Name:
arXiv:2004.01588.pdf

Description:
File downloaded from arXiv at 2021-02-03 11:20

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Malik, Jameel¹, Author
Abdelaziz, Ibrahim¹, Author
Elhayek, Ahmed¹, Author
Shimada, Soshi², Author
Ali, Sk Aziz¹, Author
Golyanik, Vladislav², Author
Theobalt, Christian², Author
Stricker, Didier¹, Author

Affiliations:
1External Organizations, ou_persistent22
2Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047

Content

show

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV

Abstract: 3D hand shape and pose estimation from a single depth map is a new and
challenging computer vision problem with many applications. The
state-of-the-art methods directly regress 3D hand meshes from 2D depth images
via 2D convolutional neural networks, which leads to artefacts in the
estimations due to perspective distortions in the images. In contrast, we
propose a novel architecture with 3D convolutions trained in a
weakly-supervised manner. The input to our method is a 3D voxelized depth map,
and we rely on two hand shape representations. The first one is the 3D
voxelized grid of the shape which is accurate but does not preserve the mesh
topology and the number of mesh vertices. The second representation is the 3D
hand surface which is less accurate but does not suffer from the limitations of
the first representation. We combine the advantages of these two
representations by registering the hand surface to the voxelized hand shape. In
the extensive experiments, the proposed approach improves over the state of the
art by 47.8% on the SynHand5M dataset. Moreover, our augmentation policy for
voxelized depth maps further enhances the accuracy of 3D hand pose estimation
on real data. Our method produces visually more reasonable and realistic hand
shapes on NYU and BigHand2.2M datasets compared to the existing approaches.

Details

show

hide

Language(s): eng - English

Dates: Created: 2020-04-03Published Online: 2020

Publication Status: Published online

Pages: 10 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2004.01588
URI: https://arxiv.org/abs/2004.01588
BibTex Citekey: Malik2004.01588

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show