Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SelfD: Self-Learning Large-Scale Driving Policies From the Web

Apr 21, 2022

Jimuyang Zhang, Ruizhao Zhu, Eshed Ohn-Bar

Figure 1 for SelfD: Self-Learning Large-Scale Driving Policies From the Web

Figure 2 for SelfD: Self-Learning Large-Scale Driving Policies From the Web

Figure 3 for SelfD: Self-Learning Large-Scale Driving Policies From the Web

Figure 4 for SelfD: Self-Learning Large-Scale Driving Policies From the Web

Share this with someone who'll enjoy it:

Abstract:Effectively utilizing the vast amounts of ego-centric navigation data that is freely available on the internet can advance generalized intelligent systems, i.e., to robustly scale across perspectives, platforms, environmental conditions, scenarios, and geographical locations. However, it is difficult to directly leverage such large amounts of unlabeled and highly diverse data for complex 3D reasoning and planning tasks. Consequently, researchers have primarily focused on its use for various auxiliary pixel- and image-level computer vision tasks that do not consider an ultimate navigational objective. In this work, we introduce SelfD, a framework for learning scalable driving by utilizing large amounts of online monocular images. Our key idea is to leverage iterative semi-supervised training when learning imitative agents from unlabeled data. To handle unconstrained viewpoints, scenes, and camera parameters, we train an image-based model that directly learns to plan in the Bird's Eye View (BEV) space. Next, we use unlabeled data to augment the decision-making knowledge and robustness of an initially trained model via self-training. In particular, we propose a pseudo-labeling step which enables making full use of highly diverse demonstration data through "hypothetical" planning-based data augmentation. We employ a large dataset of publicly available YouTube videos to train SelfD and comprehensively analyze its generalization benefits across challenging navigation scenarios. Without requiring any additional data collection or annotation efforts, SelfD demonstrates consistent improvements (by up to 24%) in driving performance evaluation on nuScenes, Argoverse, Waymo, and CARLA.

* Accepted by CVPR2022

View paper on

Share this with someone who'll enjoy it:

Title:SelfD: Self-Learning Large-Scale Driving Policies From the Web

Paper and Code