Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Mar 18, 2019

Torsten Sattler, Qunjie Zhou, Marc Pollefeys, Laura Leal-Taixe

Figure 1 for Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Figure 2 for Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Figure 3 for Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Figure 4 for Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Share this with someone who'll enjoy it:

Abstract:Visual localization is the task of accurate camera pose estimation in a known scene. It is a key problem in computer vision and robotics, with applications including self-driving cars, Structure-from-Motion, SLAM, and Mixed Reality. Traditionally, the localization problem has been tackled using 3D geometry. Recently, end-to-end approaches based on convolutional neural networks have become popular. These methods learn to directly regress the camera pose from an input image. However, they do not achieve the same level of pose accuracy as 3D structure-based methods. To understand this behavior, we develop a theoretical model for camera pose regression. We use our model to predict failure cases for pose regression techniques and verify our predictions through experiments. We furthermore use our model to show that pose regression is more closely related to pose approximation via image retrieval than to accurate pose estimation via 3D structure. A key result is that current approaches do not consistently outperform a handcrafted image retrieval baseline. This clearly shows that additional research is needed before pose regression algorithms are ready to compete with structure-based methods.

* Initial version of a paper accepted to CVPR 2019

View paper on

Share this with someone who'll enjoy it:

Title:Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Paper and Code