Analyzing the Dependency of ConvNets on Spatial Information

Fan, Yue; Xian, Yongqin; Losch, Max Maria; Schiele, Bernt

Lokale TagsFreigabegeschichteDetailsÜbersicht

Analyzing the Dependency of ConvNets on Spatial Information

Fan, Y., Xian, Y., Losch, M. M., & Schiele, B. (2020). Analyzing the Dependency of ConvNets on Spatial Information. Retrieved from https://arxiv.org/abs/2002.01827.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-0007-80CB-3 Versions-Permalink: https://hdl.handle.net/21.11116/0000-000E-21CB-8

Genre: Forschungspapier

Alternativer Titel : Analyzing the Dependency of {ConvNets} on Spatial Information

Dateien

einblenden: Dateien

ausblenden: Dateien

:

arXiv:2002.01827.pdf (Preprint), 3MB

Öffnen Speichern

Datei-Permalink:
https://hdl.handle.net/21.11116/0000-0007-80CD-1

Name:
arXiv:2002.01827.pdf

Beschreibung:
File downloaded from arXiv at 2020-12-03 07:15

OA-Status:

Sichtbarkeit:
Öffentlich

MIME-Typ / Prüfsumme:
application/pdf / [MD5]

Technische Metadaten:

Öffnen

Copyright Datum:
-

Copyright Info:
-

Lizenz:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Externe Referenzen

einblenden:

Urheber

einblenden:

ausblenden:

Urheber:
Fan, Yue¹, Autor
Xian, Yongqin¹, Autor
Losch, Max Maria¹, Autor
Schiele, Bernt¹, Autor

Affiliations:
1Computer Vision and Machine Learning, MPI for Informatics, Max Planck Society, ou_1116547

Inhalt

einblenden:

ausblenden:

Schlagwörter: Computer Science, Computer Vision and Pattern Recognition, cs.CV

Zusammenfassung: Intuitively, image classification should profit from using spatial
information. Recent work, however, suggests that this might be overrated in
standard CNNs. In this paper, we are pushing the envelope and aim to further
investigate the reliance on spatial information. We propose spatial shuffling
and GAP+FC to destroy spatial information during both training and testing
phases. Interestingly, we observe that spatial information can be deleted from
later layers with small performance drops, which indicates spatial information
at later layers is not necessary for good performance. For example, test
accuracy of VGG-16 only drops by 0.03% and 2.66% with spatial information
completely removed from the last 30% and 53% layers on CIFAR100, respectively.
Evaluation on several object recognition datasets (CIFAR100, Small-ImageNet,
ImageNet) with a wide range of CNN architectures (VGG16, ResNet50, ResNet152)
shows an overall consistent pattern.

Details

einblenden:

ausblenden:

Sprache(n): eng - English

Datum: Erstellt: 2020-02-05Online veröffentlicht: 2020

Publikationsstatus: Online veröffentlicht

Seiten: -

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: arXiv: 2002.01827
BibTex Citekey: Fan_arXiv2002.01827
URI: https://arxiv.org/abs/2002.01827

Art des Abschluß: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle