Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects 
of Context in Classification and Segmentation

Shetty, Rakshith; Schiele, Bernt; Fritz, Mario

DetailsSummary

Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation

Shetty, R., Schiele, B., & Fritz, M. (2018). Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation. Retrieved from http://arxiv.org/abs/1812.06707.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0002-B487-A Version Permalink: https://hdl.handle.net/21.11116/0000-0002-B488-9

Genre: Paper

Latex : Not Using the Car to See the Sidewalk: {Q}uantifying and Controlling the Effects of Context in Classification and Segmentation

Files

show Files

hide Files

:

arXiv:1812.06707.pdf (Preprint), 9MB

File Permalink:
-

Name:
arXiv:1812.06707.pdf

Description:
File downloaded from arXiv at 2018-12-20 09:35

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Shetty, Rakshith¹, Author
Schiele, Bernt¹, Author
Fritz, Mario², Author

Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV,Computer Science, Artificial Intelligence, cs.AI,Statistics, Machine Learning, stat.ML

Abstract: Importance of visual context in scene understanding tasks is well recognized
in the computer vision community. However, to what extent the computer vision
models for image classification and semantic segmentation are dependent on the
context to make their predictions is unclear. A model overly relying on context
will fail when encountering objects in context distributions different from
training data and hence it is important to identify these dependencies before
we can deploy the models in the real-world. We propose a method to quantify the
sensitivity of black-box vision models to visual context by editing images to
remove selected objects and measuring the response of the target models. We
apply this methodology on two tasks, image classification and semantic
segmentation, and discover undesirable dependency between objects and context,
for example that "sidewalk" segmentation relies heavily on "cars" being present
in the image. We propose an object removal based data augmentation solution to
mitigate this dependency and increase the robustness of classification and
segmentation models to contextual variations. Our experiments show that the
proposed data augmentation helps these models improve the performance in
out-of-context scenarios, while preserving the performance on regular data.

Details

show

hide

Language(s): eng - English

Dates: Created: 2018-12-17Published Online: 2018

Publication Status: Published online

Pages: 14 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1812.06707
URI: http://arxiv.org/abs/1812.06707
BibTex Citekey: shetty2018context

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show