English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

Liu, K., Zhan, F., Xu, M., Theobalt, C., Shao, L., & Lu, S. (2024). StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting. Retrieved from https://arxiv.org/abs/2403.07807.

Item is

Files

show Files
hide Files
:
arXiv:2403.07807.pdf (Preprint), 8MB
Name:
arXiv:2403.07807.pdf
Description:
File downloaded from arXiv at 2024-10-16 14:40
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show

Creators

show
hide
 Creators:
Liu, Kunhao1, Author
Zhan, Fangneng2, Author           
Xu, Muyu1, Author
Theobalt, Christian2, Author                 
Shao, Ling1, Author
Lu, Shijian1, Author
Affiliations:
1External Organizations, ou_persistent22              
2Visual Computing and Artificial Intelligence, MPI for Informatics, Max Planck Society, ou_3311330              

Content

show
hide
Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV
 Abstract: We introduce StyleGaussian, a novel 3D style transfer technique that allows
instant transfer of any image's style to a 3D scene at 10 frames per second
(fps). Leveraging 3D Gaussian Splatting (3DGS), StyleGaussian achieves style
transfer without compromising its real-time rendering ability and multi-view
consistency. It achieves instant style transfer with three steps: embedding,
transfer, and decoding. Initially, 2D VGG scene features are embedded into
reconstructed 3D Gaussians. Next, the embedded features are transformed
according to a reference style image. Finally, the transformed features are
decoded into the stylized RGB. StyleGaussian has two novel designs. The first
is an efficient feature rendering strategy that first renders low-dimensional
features and then maps them into high-dimensional features while embedding VGG
features. It cuts the memory consumption significantly and enables 3DGS to
render the high-dimensional memory-intensive features. The second is a
K-nearest-neighbor-based 3D CNN. Working as the decoder for the stylized
features, it eliminates the 2D CNN operations that compromise strict multi-view
consistency. Extensive experiments show that StyleGaussian achieves instant 3D
stylization with superior stylization quality while preserving real-time
rendering and strict multi-view consistency. Project page:
https://kunhao-liu.github.io/StyleGaussian/

Details

show
hide
Language(s): eng - English
 Dates: 2024-03-122024
 Publication Status: Published online
 Pages: 21 p.
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: arXiv: 2403.07807
URI: https://arxiv.org/abs/2403.07807
BibTex Citekey: Liu2403.07807
 Degree: -

Event

show

Legal Case

show

Project information

show

Source

show