MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised 
Monocular Reconstruction

Tewari, Ayush; Zollhöfer, Michael; Kim, Hyeongwoo; Garrido, Pablo; Bernard, Florian; Pérez, Patrick; Theobalt, Christian

DetailsSummary

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Tewari, A., Zollhöfer, M., Kim, H., Garrido, P., Bernard, F., Pérez, P., et al. (2017). MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction. Retrieved from http://arxiv.org/abs/1703.10580.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-002D-8BEA-9 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-002D-8BEB-7

Genre: Paper

Latex : {MoFA}: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Files

show Files

hide Files

:

arXiv:1703.10580.pdf (Preprint), 10MB

View Save

File Permalink:
https://hdl.handle.net/11858/00-001M-0000-002D-8BEC-5

Name:
arXiv:1703.10580.pdf

Description:
File downloaded from arXiv at 2017-07-05 13:23

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/help/license

Locators

show

Creators

show

hide

Creators:
Tewari, Ayush¹, Author
Zollhöfer, Michael¹, Author
Kim, Hyeongwoo¹, Author
Garrido, Pablo¹, Author
Bernard, Florian², Author
Pérez, Patrick², Author
Theobalt, Christian¹, Author

Affiliations:
1Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV

Abstract: In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional encoder network with an expert-designed generative model that serves as decoder. The core innovation is our new differentiable parametric decoder that encapsulates image formation analytically based on a generative model. Our decoder takes as input a code vector with exactly defined semantic meaning that encodes detailed face pose, shape, expression, skin reflectance and scene illumination. Due to this new way of combining CNN-based with model-based face reconstruction, the CNN-based encoder learns to extract semantically meaningful parameters from a single monocular input image. For the first time, a CNN encoder and an expert-designed generative model can be trained end-to-end in an unsupervised manner, which renders training on very large (unlabeled) real world data feasible. The obtained reconstructions compare favorably to current state-of-the-art approaches in terms of quality and richness of representation.

Details

show

hide

Language(s): eng - English

Dates: Created: 2017-03-30Published Online: 2017

Publication Status: Published online

Pages: 10 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1703.10580
URI: http://arxiv.org/abs/1703.10580
BibTex Citekey: DBLP:journals/corr/TewariZK0BPT17

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show