PIE: Portrait Image Embedding for Semantic Control

Tewari, Ayush; Elgharib, Mohamed; Mallikarjun B R,; Bernard, Florian; Seidel, Hans-Peter; Pérez, Patrick; Zollhöfer, Michael; Theobalt, Christian

Local TagsRelease HistoryDetailsSummary

PIE: Portrait Image Embedding for Semantic Control

Tewari, A., Elgharib, M., Mallikarjun B R, Bernard, F., Seidel, H.-P., Pérez, P., et al. (2020). PIE: Portrait Image Embedding for Semantic Control. Retrieved from https://arxiv.org/abs/2009.09485.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0007-B117-7 Version Permalink: https://hdl.handle.net/21.11116/0000-000E-31DE-1

Genre: Paper

Latex : {PIE}: {P}ortrait Image Embedding for Semantic Control

Files

show Files

hide Files

:

arXiv:2009.09485.pdf (Preprint), 12MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0007-B119-5

Name:
arXiv:2009.09485.pdf

Description:
File downloaded from arXiv at 2021-01-15 09:25 To appear in SIGGRAPH Asia 2020. Project webpage: https://gvv.mpi-inf.mpg.de/projects/PIE/

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Tewari, Ayush¹, Author
Elgharib, Mohamed¹, Author
Mallikarjun B R¹, Author
Bernard, Florian¹, Author
Seidel, Hans-Peter¹, Author
Pérez, Patrick², Author
Zollhöfer, Michael¹, Author
Theobalt, Christian¹, Author

Affiliations:
1Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV,Computer Science, Graphics, cs.GR

Abstract: Editing of portrait images is a very popular and important research topic
with a large variety of applications. For ease of use, control should be
provided via a semantically meaningful parameterization that is akin to
computer animation controls. The vast majority of existing techniques do not
provide such intuitive and fine-grained control, or only enable coarse editing
of a single isolated control parameter. Very recently, high-quality
semantically controlled editing has been demonstrated, however only on
synthetically created StyleGAN images. We present the first approach for
embedding real portrait images in the latent space of StyleGAN, which allows
for intuitive editing of the head pose, facial expression, and scene
illumination in the image. Semantic editing in parameter space is achieved
based on StyleRig, a pretrained neural network that maps the control space of a
3D morphable face model to the latent space of the GAN. We design a novel
hierarchical non-linear optimization problem to obtain the embedding. An
identity preservation energy term allows spatially coherent edits while
maintaining facial integrity. Our approach runs at interactive frame rates and
thus allows the user to explore the space of possible edits. We evaluate our
approach on a wide set of portrait photos, compare it to the current state of
the art, and validate the effectiveness of its components in an ablation study.

Details

show

hide

Language(s): eng - English

Dates: Created: 2020-09-20Published Online: 2020

Publication Status: Published online

Pages: 14 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2009.09485
URI: https://arxiv.org/abs/2009.09485
BibTex Citekey: Tewari_2009.09485

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show