Unsupervised Learning of Shape and Pose with Differentiable Point Clouds

Insafutdinov, Eldar; Dosovitskiy, Alexey

Item

ITEM ACTIONSEXPORT

DownloadE-Mail

Please note that a newer version of this item is available:
https://pure.mpg.de/pubman/item/item_3006654_3

DetailsSummary

Unsupervised Learning of Shape and Pose with Differentiable Point Clouds

Insafutdinov, E., & Dosovitskiy, A. (2018). Unsupervised Learning of Shape and Pose with Differentiable Point Clouds. Retrieved from http://arxiv.org/abs/1810.09381.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0002-6A15-0 Version Permalink: https://hdl.handle.net/21.11116/0000-0002-6A16-F

Genre: Paper

Files

show Files

hide Files

arXiv:1810.09381.pdf (Preprint), 10MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0002-6A17-E

Name:
arXiv:1810.09381.pdf

Description:
File downloaded from arXiv at 2018-10-31 08:49

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Locators

show

Creators

show

hide

Creators:
Insafutdinov, Eldar¹, Author
Dosovitskiy, Alexey², Author

Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Computer Vision and Pattern Recognition, cs.CV,Computer Science, Learning, cs.LG

Abstract: We address the problem of learning accurate 3D shape and camera pose from a
collection of unlabeled category-specific images. We train a convolutional
network to predict both the shape and the pose from a single image by
minimizing the reprojection error: given several views of an object, the
projections of the predicted shapes to the predicted camera poses should match
the provided views. To deal with pose ambiguity, we introduce an ensemble of
pose predictors which we then distill to a single "student" model. To allow for
efficient learning of high-fidelity shapes, we represent the shapes by point
clouds and devise a formulation allowing for differentiable projection of
these. Our experiments show that the distilled ensemble of pose predictors
learns to estimate the pose accurately, while the point cloud representation
allows to predict detailed shape models. The supplementary video can be found
at https://www.youtube.com/watch?v=LuIGovKeo60

Details

show

hide

Language(s): eng - English

Dates: Created: 2018-10-22Published Online: 2018

Publication Status: Published online

Pages: 16 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1810.09381
URI: http://arxiv.org/abs/1810.09381

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show