Towards matching peripheral appearance for arbitrary natural images using deep 
features

Wallis, T; Funke, C; Ecker, A; Gatys, L; Wichmann, F; Bethge, M

doi:10.1167/17.10.786

Lokale TagsFreigabegeschichteDetailsÜbersicht

Towards matching peripheral appearance for arbitrary natural images using deep features

Wallis, T., Funke, C., Ecker, A., Gatys, L., Wichmann, F., & Bethge, M. (2017). Towards matching peripheral appearance for arbitrary natural images using deep features. Poster presented at 17th Annual Meeting of the Vision Sciences Society (VSS 2017), St. Pete Beach, FL, USA.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0000-C44B-F Versions-Permalink: https://hdl.handle.net/21.11116/0000-0006-B4B5-2

Genre: Poster

Dateien

einblenden: Dateien

Externe Referenzen

einblenden:

ausblenden:

externe Referenz:
Link (beliebiger Volltext) Open Access Status unbekannt

Beschreibung:
-

OA-Status:

Urheber

einblenden:

ausblenden:

Urheber:
Wallis, T, Autor
Funke, C, Autor
Ecker, A, Autor
Gatys, L, Autor
Wichmann, F, Autor
Bethge, M¹, Autor

Affiliations:
1External Organizations, ou_persistent22

Inhalt

einblenden:

ausblenden:

Schlagwörter: -

Zusammenfassung: Due to the structure of the primate visual system, large distortions of the input can go unnoticed in the periphery, and objects can be harder to identify. What encoding underlies these effects? Similarly to Freeman
Simoncelli (Nature Neuroscience, 2011), we developed a model that uses summary statistics averaged over spatial regions that increases with retinal eccentricity (assuming central fixation on an image). We also designed the averaging areas such that changing their scaling progressively discards more information from the original image (i.e. a coarser model produces greater distortions to original image structure than a model with higher
resolution). Different from Freeman and Simoncelli, we use the features of a deep neural network trained on object recognition (the VGG-19; Simonyan Zisserman, ICLR 2015), which is state-of-the art in parametric texture synthesis. We tested whether human observers can discriminate model-
generated images from their original source images. Three images subtending 25 deg, two of which were physically identical, were presented for 200 ms each in a three-alternative temporal oddity paradigm. We find a model that, for most original images we tested, produces synthesised
images that cannot be told apart from the originals despite producing significant distortions of image structure. However, some images were readily discriminable. Therefore, the model has successfully encoded necessary but not sufficient information to capture appearance in human scene perception. We explore what image features are correlated with discriminability on the image (which images are harder than others?) and pixel (where in an image is the hardest location?) level. While our model does not produce
“metamers”, it does capture many features important for the appearance of arbitrary natural images in the periphery.

Details

einblenden:

ausblenden:

Sprache(n):

Datum: Erschienen: 2017-08

Publikationsstatus: Erschienen

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: DOI: 10.1167/17.10.786
BibTex Citekey: WallisFEGWB2017

Art des Abschluß: -

Veranstaltung

einblenden:

ausblenden:

Titel: 17th Annual Meeting of the Vision Sciences Society (VSS 2017)

Veranstaltungsort: St. Pete Beach, FL, USA

Start-/Enddatum: 2017-05-19 - 2017-05-24

ausblenden:

Titel: Journal of Vision

Genre der Quelle: Zeitschrift

Urheber:

Affiliations:

Ort, Verlag, Ausgabe: Charlottesville, VA : Scholar One, Inc.

Seiten: - Band / Heft: 17 (10) Artikelnummer: - Start- / Endseite: 786 Identifikator: ISSN: 1534-7362
CoNE: https://pure.mpg.de/cone/journals/resource/111061245811050

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle 1