Automatic Face Reenactment

Garrido, Pablo; Valgaerts, Levi; Rehmsen, Ole; Thormählen, Thorsten; Perez, Patrick; Theobalt, Christian

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Paper

Automatic Face Reenactment

MPS-Authors

/persons/resource/persons127194

Garrido, Pablo
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons73102

Valgaerts, Levi
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons73104

Rehmsen, Ole
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons45610

Theobalt, Christian
Computer Graphics, MPI for Informatics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

arXiv:1602.02651.pdf
(Preprint), 299KB

Supplementary Material (public)

There is no public supplementary material available

Citation

Garrido, P., Valgaerts, L., Rehmsen, O., Thormählen, T., Perez, P., & Theobalt, C. (2016). Automatic Face Reenactment. Retrieved from http://arxiv.org/abs/1602.02651.

Cite as: https://hdl.handle.net/11858/00-001M-0000-002B-9A53-8

Abstract

We propose an image-based, facial reenactment system that replaces the face of an actor in an existing target video with the face of a user from a source video, while preserving the original target performance. Our system is fully automatic and does not require a database of source expressions. Instead, it is able to produce convincing reenactment results from a short source video captured with an off-the-shelf camera, such as a webcam, where the user performs arbitrary facial gestures. Our reenactment pipeline is conceived as part image retrieval and part face transfer: The image retrieval is based on temporal clustering of target frames and a novel image matching metric that combines appearance and motion to select candidate frames from the source video, while the face transfer uses a 2D warping strategy that preserves the user's identity. Our system excels in simplicity as it does not rely on a 3D face model, it is robust under head motion and does not require the source and target performance to be similar. We show convincing reenactment results for videos that we recorded ourselves and for low-quality footage taken from the Internet.