User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse




Conference Paper

Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking


Sharma,  Saurabh
Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society;

Fulltext (public)
There are no public fulltexts available
Supplementary Material (public)
There is no public supplementary material available

Sharma, S., Varigonda, P. T., Bindal, P., Sharma, A., & Jain, A. (in press). Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking. In ICCV 2019. Piscataway, NJ: IEEE.

Cite as: http://hdl.handle.net/21.11116/0000-0004-59A8-B
Monocular 3D Human Pose Estimation from static images is a challenging problem, due to the curse of dimensionality and the ill-posed nature of lifting 2D to 3D. In this paper, we propose a Deep Conditional Variational Autoencoder based model that synthesizes diverse 3D pose samples conditioned on the estimated 2D pose. Our experiments reveal that the CVAE generates significantly diverse 3D samples that are consistent with the 2D pose, thereby reducing the ambiguity in lifting from 2D-to-3D. We use two strategies for predicting the final 3D pose - (a) depth-ordering/ordinal relations to score and aggregate the final 3D pose, or OrdinalScore, and (b) with supervision from an Oracle. We report close to state of the art results on two benchmark datasets using OrdinalScore, and state-of-the-art results using the Oracle. We also show our pipeline gives competitive results without paired 3D supervision. We shall make the training and evaluation code available at https://github.com/ssfootball04/generative_pose.