Style and Pose Control for Image Synthesis of Humans from a Single Monocular 
View

Sarkar, Kripasindhu; Golyanik, Vladislav; Liu, Lingjie; Theobalt, Christian

Local TagsRelease HistoryDetailsSummary

Style and Pose Control for Image Synthesis of Humans from a Single Monocular View

Sarkar, K., Golyanik, V., Liu, L., & Theobalt, C. (2021). Style and Pose Control for Image Synthesis of Humans from a Single Monocular View. Retrieved from https://arxiv.org/abs/2102.11263.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0009-53BB-7 Version Permalink: https://hdl.handle.net/21.11116/0000-000E-328C-C

Genre: Paper

Files

show Files

hide Files

:

arXiv:2102.11263.pdf (Preprint), 19MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0009-53BD-5

Name:
arXiv:2102.11263.pdf

Description:
File downloaded from arXiv at 2021-10-08 12:03

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Sarkar, Kripasindhu¹, Author
Golyanik, Vladislav¹, Author
Liu, Lingjie¹, Author
Theobalt, Christian¹, Author

Affiliations:
1Visual Computing and Artificial Intelligence, MPI for Informatics, Max Planck Society, ou_3311330

Content

show

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV

Abstract: Photo-realistic re-rendering of a human from a single image with explicit
control over body pose, shape and appearance enables a wide range of
applications, such as human appearance transfer, virtual try-on, motion
imitation, and novel view synthesis. While significant progress has been made
in this direction using learning-based image generation tools, such as GANs,
existing approaches yield noticeable artefacts such as blurring of fine
details, unrealistic distortions of the body parts and garments as well as
severe changes of the textures. We, therefore, propose a new method for
synthesising photo-realistic human images with explicit control over pose and
part-based appearance, i.e., StylePoseGAN, where we extend a non-controllable
generator to accept conditioning of pose and appearance separately. Our network
can be trained in a fully supervised way with human images to disentangle pose,
appearance and body parts, and it significantly outperforms existing single
image re-rendering methods. Our disentangled representation opens up further
applications such as garment transfer, motion transfer, virtual try-on, head
(identity) swap and appearance interpolation. StylePoseGAN achieves
state-of-the-art image generation fidelity on common perceptual metrics
compared to the current best-performing methods and convinces in a
comprehensive user study.

Details

show

hide

Language(s): eng - English

Dates: Created: 2021-02-22Published Online: 2021

Publication Status: Published online

Pages: 15 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2102.11263
URI: https://arxiv.org/abs/2102.11263
BibTex Citekey: Sarkar_arXiv2102.11263

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show