Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image

Publication TypeConference Paper
Year of Publication2016
AuthorsBrachmann, E, Michel, F, Krull, A, Yang, MYing, Gumhold, S, Rother, C
Conference NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISBN Number9781467388504

In recent years, the task of estimating the 6D pose of object instances and complete scenes, i.e. camera localization, from a single input image has received considerable attention. Consumer RGB-D cameras have made this feasible, even for difficult, texture-less objects and scenes. In this work, we show that a single RGB image is sufficient to achieve visually convincing results. Our key concept is to model and exploit the uncertainty of the system at all stages of the processing pipeline. The uncertainty comes in the form of continuous distributions over 3D object coordinates and discrete distributions over object labels. We give three technical contributions. Firstly, we develop a regularized, auto-context regression framework which iteratively reduces uncertainty in object coordinate and object label predictions. Secondly, we introduce an efficient way to marginalize object coordinate distributions over depth. This is necessary to deal with missing depth information. Thirdly, we utilize the distributions over object labels to detect multiple objects simultaneously with a fixed budget of RANSAC hypotheses. We tested our system for object pose estimation and camera localization on commonly used data sets. We see a major improvement over competing systems.

Citation KeyBrachmann2016