Teaching 3D Geometry to Deformable Part Models

Pepik, Bojan; Stark, Michael; Gehler, Peter; Schiele, Bernt

doi:10.1109/CVPR.2012.6248075

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Conference Paper

Teaching 3D Geometry to Deformable Part Models

MPS-Authors

/persons/resource/persons45176

Pepik, Bojan
Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society;

/persons/resource/persons45541

Stark, Michael
Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society;

/persons/resource/persons44483

Gehler, Peter
Dept. Perceiving Systems, Max Planck Institute for Intelligent Systems, Max Planck Society;

/persons/resource/persons45383

Schiele, Bernt
Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Pepik, B., Stark, M., Gehler, P., & Schiele, B. (2012). Teaching 3D Geometry to Deformable Part Models. In 2012 IEEE Conference on Computer Vision and Pattern Recognition (pp. 3362-3369). Piscataway, NJ: IEEE. doi:10.1109/CVPR.2012.6248075.

Cite as: https://hdl.handle.net/11858/00-001M-0000-000E-F5AF-1

Abstract

Current object class recognition systems typically target 2D bounding box localization, encouraged by benchmark data sets, such as Pascal VOC. While this seems suitable for the detection of individual objects, higher-level applications such as 3D scene understanding or 3D object tracking would benefit from more fine-grained object hypotheses incorporating 3D geometric information, such as viewpoints or the locations of individual parts. In this paper, we help narrowing the representational gap between the ideal input of a scene understanding system and object class detector output, by designing a detector particularly tailored towards 3D geometric reasoning. In particular, we extend the successful discriminatively trained deformable part models to include both estimates of viewpoint and 3D parts that are consistent across viewpoints. We experimentally verify that adding 3D geometric information comes at minimal performance loss w.r.t. 2D bounding box localization, but outperforms prior work in 3D viewpoint estimation and ultra-wide baseline matching.