English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Habibie, I., Xu, W., Mehta, D., Pons-Moll, G., & Theobalt, C. (2019). In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations. Retrieved from http://arxiv.org/abs/1904.03289.

Item is

Files

show Files
hide Files
:
arXiv:1904.03289.pdf (Preprint), 4MB
Name:
arXiv:1904.03289.pdf
Description:
File downloaded from arXiv at 2019-07-05 11:25 Accepted to CVPR 2019
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Habibie, Ikhsanul1, Author           
Xu, Weipeng1, Author           
Mehta, Dushyant1, Author           
Pons-Moll, Gerard2, Author                 
Theobalt, Christian1, Author                 
Affiliations:
1Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047              
2Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society, ou_persistent22              

Content

show
hide
Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV
 Abstract: Convolutional Neural Network based approaches for monocular 3D human pose
estimation usually require a large amount of training images with 3D pose
annotations. While it is feasible to provide 2D joint annotations for large
corpora of in-the-wild images with humans, providing accurate 3D annotations to
such in-the-wild corpora is hardly feasible in practice. Most existing 3D
labelled data sets are either synthetically created or feature in-studio
images. 3D pose estimation algorithms trained on such data often have limited
ability to generalize to real world scene diversity. We therefore propose a new
deep learning based method for monocular 3D human pose estimation that shows
high accuracy and generalizes better to in-the-wild scenes. It has a network
architecture that comprises a new disentangled hidden space encoding of
explicit 2D and 3D features, and uses supervision by a new learned projection
model from predicted 3D pose. Our algorithm can be jointly trained on image
data with 3D labels and image data with only 2D labels. It achieves
state-of-the-art accuracy on challenging in-the-wild data.

Details

show
hide
Language(s): eng - English
 Dates: 2019-04-052019
 Publication Status: Published online
 Pages: 15 p.
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: arXiv: 1904.03289
URI: http://arxiv.org/abs/1904.03289
BibTex Citekey: Habibie_arXiv1904.03289
 Degree: -

Event

show

Legal Case

show

Project information

show

Source

show