Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at 
over 250 Hz

Tewari, Ayush; Zollhöfer, Michael; Garrido, Pablo; Bernard, Florian; Kim, Hyeongwoo; Pérez, Patrick; Theobalt, Christian

アイテム詳細

登録内容を編集ファイル形式で保存

一時保存へ追加

タグ情報を表示リリース履歴を表示詳細要約

公開

成果報告書

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz

MPS-Authors

/persons/resource/persons206546

Tewari, Ayush
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons136490

Zollhöfer, Michael
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons127194

Garrido, Pablo
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons214986

Bernard, Florian
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons127713

Kim, Hyeongwoo
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons45610

Theobalt, Christian
Computer Graphics, MPI for Informatics, Max Planck Society;

External Resource

There are no locators available

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

フルテキスト (公開)

arXiv:1712.02859.pdf
(プレプリント), 4MB

付随資料 (公開)

There is no public supplementary material available

引用

Tewari, A., Zollhöfer, M., Garrido, P., Bernard, F., Kim, H., Pérez, P., & Theobalt, C. (2017). Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz. Retrieved from http://arxiv.org/abs/1712.02859.

引用: https://hdl.handle.net/21.11116/0000-0000-615E-A

要旨

The reconstruction of dense 3D models of face geometry and appearance from a single image is highly challenging and ill-posed. To constrain the problem, many approaches rely on strong priors, such as parametric face models learned from limited 3D scan data. However, prior models restrict generalization of the true diversity in facial geometry, skin reflectance and illumination. To alleviate this problem, we present the first approach that jointly learns 1) a regressor for face shape, expression, reflectance and illumination on the basis of 2) a concurrently learned parametric face model. Our multi-level face model combines the advantage of 3D Morphable Models for regularization with the out-of-space generalization of a learned corrective space. We train end-to-end on in-the-wild images without dense annotations by fusing a convolutional encoder with a differentiable expert-designed renderer and a self-supervised training loss, both defined at multiple detail levels. Our approach compares favorably to the state-of-the-art in terms of reconstruction quality, better generalizes to real world faces, and runs at over 250 Hz.