Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Jan 10, 2023

Xindi Wu, KwunFung Lau, Francesco Ferroni, Aljoša Ošep, Deva Ramanan

Figure 1 for Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Figure 2 for Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Figure 3 for Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Figure 4 for Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Share this with someone who'll enjoy it:

Abstract:Self-driving vehicles rely on urban street maps for autonomous navigation. In this paper, we introduce Pix2Map, a method for inferring urban street map topology directly from ego-view images, as needed to continually update and expand existing maps. This is a challenging task, as we need to infer a complex urban road topology directly from raw image data. The main insight of this paper is that this problem can be posed as cross-modal retrieval by learning a joint, cross-modal embedding space for images and existing maps, represented as discrete graphs that encode the topological layout of the visual surroundings. We conduct our experimental evaluation using the Argoverse dataset and show that it is indeed possible to accurately retrieve street maps corresponding to both seen and unseen roads solely from image data. Moreover, we show that our retrieved maps can be used to update or expand existing maps and even show proof-of-concept results for visual localization and image retrieval from spatial graphs.

* 12 pages, 8 figures

View paper on

Share this with someone who'll enjoy it:

Title:Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Paper and Code