Learning Foveated Reconstruction to Preserve Perceived Image Statistics

Surace, Luca; Wernikowski, Marek; Tursun, Okan Tarhan; Myszkowski, Karol; Mantiuk, Radosław; Didyk, Piotr

Local TagsRelease HistoryDetailsSummary

Learning Foveated Reconstruction to Preserve Perceived Image Statistics

Surace, L., Wernikowski, M., Tursun, O. T., Myszkowski, K., Mantiuk, R., & Didyk, P. (2021). Learning Foveated Reconstruction to Preserve Perceived Image Statistics. Retrieved from https://arxiv.org/abs/2108.03499.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0009-73D9-1 Version Permalink: https://hdl.handle.net/21.11116/0000-0009-73DA-0

Genre: Paper

Files

show Files

hide Files

:

arXiv:2108.03499.pdf (Preprint), 33MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0009-73DB-F

Name:
arXiv:2108.03499.pdf

Description:
File downloaded from arXiv at 2021-11-08 09:16

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

Creators

show

hide

Creators:
Surace, Luca¹, Author
Wernikowski, Marek¹, Author
Tursun, Okan Tarhan¹, Author
Myszkowski, Karol², Author
Mantiuk, Radosław¹, Author
Didyk, Piotr¹, Author

Affiliations:
1External Organizations, ou_persistent22
2Computer Graphics, MPI for Informatics, Max Planck Society, ou_40047

Content

show

hide

Free keywords: Computer Science, Graphics, cs.GR,Computer Science, Computer Vision and Pattern Recognition, cs.CV

Abstract: Foveated image reconstruction recovers full image from a sparse set of
samples distributed according to the human visual system's retinal sensitivity
that rapidly drops with eccentricity. Recently, the use of Generative
Adversarial Networks was shown to be a promising solution for such a task as
they can successfully hallucinate missing image information. Like for other
supervised learning approaches, also for this one, the definition of the loss
function and training strategy heavily influences the output quality. In this
work, we pose the question of how to efficiently guide the training of foveated
reconstruction techniques such that they are fully aware of the human visual
system's capabilities and limitations, and therefore, reconstruct visually
important image features. Due to the nature of GAN-based solutions, we
concentrate on the human's sensitivity to hallucination for different input
sample densities. We present new psychophysical experiments, a dataset, and a
procedure for training foveated image reconstruction. The strategy provides
flexibility to the generator network by penalizing only perceptually important
deviations in the output. As a result, the method aims to preserve perceived
image statistics rather than natural image statistics. We evaluate our strategy
and compare it to alternative solutions using a newly trained objective metric
and user experiments.

Details

show

hide

Language(s): eng - English

Dates: Created: 2021-08-07Published Online: 2021

Publication Status: Published online

Pages: 26 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2108.03499
URI: https://arxiv.org/abs/2108.03499
BibTex Citekey: Surace2108.03499

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show