Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

May 29, 2024

Nikhil Gosala, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Paulo Drews-Jr, Wolfram Burgard, Abhinav Valada

Figure 1 for LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

Figure 2 for LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

Figure 3 for LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

Figure 4 for LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

Share this with someone who'll enjoy it:

Abstract:Semantic Bird's Eye View (BEV) maps offer a rich representation with strong occlusion reasoning for various decision making tasks in autonomous driving. However, most BEV mapping approaches employ a fully supervised learning paradigm that relies on large amounts of human-annotated BEV ground truth data. In this work, we address this limitation by proposing the first unsupervised representation learning approach to generate semantic BEV maps from a monocular frontal view (FV) image in a label-efficient manner. Our approach pretrains the network to independently reason about scene geometry and scene semantics using two disjoint neural pathways in an unsupervised manner and then finetunes it for the task of semantic BEV mapping using only a small fraction of labels in the BEV. We achieve label-free pretraining by exploiting spatial and temporal consistency of FV images to learn scene geometry while relying on a novel temporal masked autoencoder formulation to encode the scene representation. Extensive evaluations on the KITTI-360 and nuScenes datasets demonstrate that our approach performs on par with the existing state-of-the-art approaches while using only 1% of BEV labels and no additional labeled data.

* 23 pages, 5 figures

View paper on

Share this with someone who'll enjoy it:

Title:LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

Paper and Code