Shape-Centered Representations: From Features to Applications

Engel, D

Local TagsRelease HistoryDetailsSummary

Shape-Centered Representations: From Features to Applications

Engel, D. (2011). Shape-Centered Representations: From Features to Applications. Berlin, Germany: Logos Verlag.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-BCFA-1 Version Permalink: https://hdl.handle.net/21.11116/0000-0003-1C4B-B

Genre: Book

Files

show Files

Locators

show

hide

Locator:
http://www.logos-verlag.de/cgi-bin/engbuchmid?isbn=2820lng=deuid= (Abstract) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
Engel, D^{1, 2, 3}, Author

Affiliations:
1Department Human Perception, Cognition and Action, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497797
2Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_1497794
3Project group: Cognitive Engineering, Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_2528702

Content

show

hide

Free keywords: -

Abstract: Computer vision aims to teach machines and algorithms to 'see' with the ultimate goal of creating 'intelligent' applications and devices that can provide assistance to humans in a wide array of scenarios. This thesis presents an investigation of computer vision on three layers: low-level features, mid-level representations and high-level applications. Each of the layers depends on the previous ones while also generating constraints and requirements for them. At the application layer human-machine interfaces come into play and link the human perception to computer vision. By studying all layers we can gain a much deeper insight into the interplay of different methods, than by examining an isolated problem. Furthermore, we are able to factor constraints imposed by different layers and the users into the design of the algorithms, instead of optimizing a single method based purely on algorithmic performance measures. After a brief introduction in Chapter 1, Chapter 2 addresses the feature layer and describes our novel shape-centered interest points that play a vital role throughout this thesis. These interest points are formed at location of high local symmetry as opposed to corner interest points which occur along the outline of shapes.
Experiments show that they are very robust with respect to common natural image transformations, such as scaling, rotation and the introduction of noise and clutter. Based on these features Chapter 3 presents two strategies to build robust mid-level image representations. First, a novel feature grouping method is introduced. The scheme offers a powerful way to combine the advantages of shape-centered interest points, namely robustness and a tight connection to a unique shape, and corner-based interest points, namely strong descriptors. Furthermore, Chapter 3 introduces a novel set of medial feature superpixels, that represent a feed-forward way to divide the image into small, visually-homogeneous regions offering a compact and efficient mid-level representation of the image information. Finally, Chapter 4 bridges the gap between computer vision and the human observer by introducing three applications that employ the shape-centered representations from the two previous Chapters. First a multi-class scene labeling scheme is presented that produces dense annotations of images, combining a local prediction step with a global optimization scheme. Then, Section 4.2 introduces a novel image retrieval tool that operates on highlevel semantic information. Such semantic annotations could be generated by automatic annotation schemes as the one described in the previous Section. Finally, the novel idea of predicting the detectability of a pedestrian in a driver assistance context is put forward and investigated. The different modules of this thesis are tightly connected and inter-dependent, in the framework of shape-centered representations. The connections between the modules avails the possibility to feed information back from higher to lower layers and optimize the design choices there. This thesis provides a framework looking at static phenomena but the presented approach could be extended to the analysis of dynamic scenes as well.

Details

show

hide

Language(s):

Dates: Date issued: 2011

Publication Status: Issued

Pages: 173

Publishing info: Berlin, Germany : Logos Verlag

Table of Contents: -

Rev. Type: -

Identifiers: ISBN: 978-3-8325-2820-1
BibTex Citekey: Engel2011_2

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: MPI Series in Biological Cybernetics

Source Genre: Series

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 27 Sequence Number: - Start / End Page: - Identifier: -