Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

N. Yang

Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

Oct 07, 2020

L. Koestler, N. Yang, R. Wang, D. Cremers

Figure 1 for Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

Figure 2 for Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

Figure 3 for Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

Figure 4 for Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

Abstract:The training of deep-learning-based 3D object detectors requires large datasets with 3D bounding box labels for supervision that have to be generated by hand-labeling. We propose a network architecture and training procedure for learning monocular 3D object detection without 3D bounding box labels. By representing the objects as triangular meshes and employing differentiable shape rendering, we define loss functions based on depth maps, segmentation masks, and ego- and object-motion, which are generated by pre-trained, off-the-shelf networks. We evaluate the proposed algorithm on the real-world KITTI dataset and achieve promising performance in comparison to state-of-the-art methods requiring 3D bounding box labels for training and superior performance to conventional baseline methods.

Via

Access Paper or Ask Questions