English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Paper

Edge-Preserving Noise for Diffusion Models

MPS-Authors
/persons/resource/persons305922

Vandersanden,  Jente
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons303045

Holl,  Sascha
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons277749

Huang,  Xingchang
Computer Graphics, MPI for Informatics, Max Planck Society;

/persons/resource/persons225790

Singh,  Gurprit
Computer Graphics, MPI for Informatics, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

arXiv:2410.01540.pdf
(Preprint), 44MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Vandersanden, J., Holl, S., Huang, X., & Singh, G. (2024). Edge-Preserving Noise for Diffusion Models. Retrieved from https://arxiv.org/abs/2410.01540.


Cite as: https://hdl.handle.net/21.11116/0000-0010-8AF7-C
Abstract
Classical generative diffusion models learn an isotropic Gaussian denoising
process, treating all spatial regions uniformly, thus neglecting potentially
valuable structural information in the data. Inspired by the long-established
work on anisotropic diffusion in image processing, we present a novel
edge-preserving diffusion model that is a generalization of denoising diffusion
probablistic models (DDPM). In particular, we introduce an edge-aware noise
scheduler that varies between edge-preserving and isotropic Gaussian noise. We
show that our model's generative process converges faster to results that more
closely match the target distribution. We demonstrate its capability to better
learn the low-to-mid frequencies within the dataset, which plays a crucial role
in representing shapes and structural information. Our edge-preserving
diffusion process consistently outperforms state-of-the-art baselines in
unconditional image generation. It is also more robust for generative tasks
guided by a shape-based prior, such as stroke-to-image generation. We present
qualitative and quantitative results showing consistent improvements (FID
score) of up to 30% for both tasks. We provide source code and supplementary
content via the public domain edge-preserving-diffusion.mpi-inf.mpg.de .