A Pooling Approach to Modelling Spatial Relations for Image Retrieval and 
Annotation

Malinowski, Mateusz; Fritz, Mario

アイテム詳細

登録内容を編集ファイル形式で保存

一時保存へ追加

タグ情報を表示リリース履歴を表示詳細要約

公開

成果報告書

A Pooling Approach to Modelling Spatial Relations for Image Retrieval and Annotation

MPS-Authors

/persons/resource/persons44976

Malinowski, Mateusz
Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society;

/persons/resource/persons44451

Fritz, Mario
Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society;

External Resource

There are no locators available

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

フルテキスト (公開)

arXiv:1411.5190.pdf
(プレプリント), 2MB

付随資料 (公開)

There is no public supplementary material available

引用

Malinowski, M., & Fritz, M. (2014). A Pooling Approach to Modelling Spatial Relations for Image Retrieval and Annotation. Retrieved from http://arxiv.org/abs/1411.5190.

引用: https://hdl.handle.net/11858/00-001M-0000-0024-4D38-0

要旨

Over the last two decades we have witnessed strong progress on modeling visual object classes, scenes and attributes that have significantly contributed to automated image understanding. On the other hand, surprisingly little progress has been made on incorporating a spatial representation and reasoning in the inference process. In this work, we propose a pooling interpretation of spatial relations and show how it improves image retrieval and annotations tasks involving spatial language. Due to the complexity of the spatial language, we argue for a learning-based approach that acquires a representation of spatial relations by learning parameters of the pooling operator. We show improvements on previous work on two datasets and two different tasks as well as provide additional insights on a new dataset with an explicit focus on spatial relations.