Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter Ondruska

DriverGym: Democratising Reinforcement Learning for Autonomous Driving

Nov 12, 2021

Parth Kothari, Christian Perone, Luca Bergamini, Alexandre Alahi, Peter Ondruska

Figure 1 for DriverGym: Democratising Reinforcement Learning for Autonomous Driving

Figure 2 for DriverGym: Democratising Reinforcement Learning for Autonomous Driving

Figure 3 for DriverGym: Democratising Reinforcement Learning for Autonomous Driving

Figure 4 for DriverGym: Democratising Reinforcement Learning for Autonomous Driving

Abstract:Despite promising progress in reinforcement learning (RL), developing algorithms for autonomous driving (AD) remains challenging: one of the critical issues being the absence of an open-source platform capable of training and effectively validating the RL policies on real-world data. We propose DriverGym, an open-source OpenAI Gym-compatible environment specifically tailored for developing RL algorithms for autonomous driving. DriverGym provides access to more than 1000 hours of expert logged data and also supports reactive and data-driven agent behavior. The performance of an RL policy can be easily validated on real-world data using our extensive and flexible closed-loop evaluation protocol. In this work, we also provide behavior cloning baselines using supervised learning and RL, trained in DriverGym. We make DriverGym code, as well as all the baselines publicly available to further stimulate development from the community.

* Accepted to NeurIPS 2021 ML4AD Workshop

Via

Access Paper or Ask Questions

SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Sep 28, 2021

Matt Vitelli, Yan Chang, Yawei Ye, Maciej Wołczyk, Błażej Osiński, Moritz Niendorf, Hugo Grimmett, Qiangui Huang, Ashesh Jain, Peter Ondruska

Figure 1 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Figure 2 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Figure 3 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Figure 4 for SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Abstract:In this paper we present the first safe system for full control of self-driving vehicles trained from human demonstrations and deployed in challenging, real-world, urban environments. Current industry-standard solutions use rule-based systems for planning. Although they perform reasonably well in common scenarios, the engineering complexity renders this approach incompatible with human-level performance. On the other hand, the performance of machine-learned (ML) planning solutions can be improved by simply adding more exemplar data. However, ML methods cannot offer safety guarantees and sometimes behave unpredictably. To combat this, our approach uses a simple yet effective rule-based fallback layer that performs sanity checks on an ML planner's decisions (e.g. avoiding collision, assuring physical feasibility). This allows us to leverage ML to handle complex situations while still assuring the safety, reducing ML planner-only collisions by 95%. We train our ML planner on 300 hours of expert driving demonstrations using imitation learning and deploy it along with the fallback layer in downtown San Francisco, where it takes complete control of a real vehicle and navigates a wide variety of challenging urban driving scenarios.

Via

Access Paper or Ask Questions

Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Sep 27, 2021

Oliver Scheel, Luca Bergamini, Maciej Wołczyk, Błażej Osiński, Peter Ondruska

Figure 1 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Figure 2 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Figure 3 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Figure 4 for Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Abstract:In this work we are the first to present an offline policy gradient method for learning imitative policies for complex urban driving from a large corpus of real-world demonstrations. This is achieved by building a differentiable data-driven simulator on top of perception outputs and high-fidelity HD maps of the area. It allows us to synthesize new driving experiences from existing demonstrations using mid-level representations. Using this simulator we then train a policy network in closed-loop employing policy gradients. We train our proposed method on 100 hours of expert demonstrations on urban roads and show that it learns complex driving policies that generalize well and can perform a variety of driving maneuvers. We demonstrate this in simulation as well as deploy our model to self-driving vehicles in the real-world. Our method outperforms previously demonstrated state-of-the-art for urban driving scenarios -- all this without the need for complex state perturbations or collecting additional on-policy data during training. We make code and data publicly available.

* CoRL 2021

Via

Access Paper or Ask Questions

Autonomy 2.0: Why is self-driving always 5 years away?

Aug 09, 2021

Ashesh Jain, Luca Del Pero, Hugo Grimmett, Peter Ondruska

Figure 1 for Autonomy 2.0: Why is self-driving always 5 years away?

Figure 2 for Autonomy 2.0: Why is self-driving always 5 years away?

Figure 3 for Autonomy 2.0: Why is self-driving always 5 years away?

Figure 4 for Autonomy 2.0: Why is self-driving always 5 years away?

Abstract:Despite the numerous successes of machine learning over the past decade (image recognition, decision-making, NLP, image synthesis), self-driving technology has not yet followed the same trend. In this paper, we study the history, composition, and development bottlenecks of the modern self-driving stack. We argue that the slow progress is caused by approaches that require too much hand-engineering, an over-reliance on road testing, and high fleet deployment costs. We observe that the classical stack has several bottlenecks that preclude the necessary scale needed to capture the long tail of rare events. To resolve these problems, we outline the principles of Autonomy 2.0, an ML-first approach to self-driving, as a viable alternative to the currently adopted state-of-the-art. This approach is based on (i) a fully differentiable AV stack trainable from human demonstrations, (ii) closed-loop data-driven reactive simulation, and (iii) large-scale, low-cost data collections as critical solutions towards scalability issues. We outline the general architecture, survey promising works in this direction and propose key challenges to be addressed by the community in the future.

Via

Access Paper or Ask Questions

What data do we need for training an AV motion planner?

May 26, 2021

Long Chen, Lukas Platinsky, Stefanie Speichert, Blazej Osinski, Oliver Scheel, Yawei Ye, Hugo Grimmett, Luca del Pero, Peter Ondruska

Figure 1 for What data do we need for training an AV motion planner?

Figure 2 for What data do we need for training an AV motion planner?

Figure 3 for What data do we need for training an AV motion planner?

Figure 4 for What data do we need for training an AV motion planner?

Abstract:We investigate what grade of sensor data is required for training an imitation-learning-based AV planner on human expert demonstration. Machine-learned planners are very hungry for training data, which is usually collected using vehicles equipped with the same sensors used for autonomous operation. This is costly and non-scalable. If cheaper sensors could be used for collection instead, data availability would go up, which is crucial in a field where data volume requirements are large and availability is small. We present experiments using up to 1000 hours worth of expert demonstration and find that training with 10x lower-quality data outperforms 1x AV-grade data in terms of planner performance. The important implication of this is that cheaper sensors can indeed be used. This serves to improve data access and democratize the field of imitation-based motion planning. Alongside this, we perform a sensitivity analysis of planner performance as a function of perception range, field-of-view, accuracy, and data volume, and the reason why lower-quality data still provide good planning results.

* Published at 2021 International Conference on Robotics and Automation (ICRA2021)

Via

Access Paper or Ask Questions

SimNet: Learning Reactive Self-driving Simulations from Real-world Observations

May 26, 2021

Luca Bergamini, Yawei Ye, Oliver Scheel, Long Chen, Chih Hu, Luca Del Pero, Blazej Osinski, Hugo Grimmett, Peter Ondruska

Figure 1 for SimNet: Learning Reactive Self-driving Simulations from Real-world Observations

Figure 2 for SimNet: Learning Reactive Self-driving Simulations from Real-world Observations

Figure 3 for SimNet: Learning Reactive Self-driving Simulations from Real-world Observations

Figure 4 for SimNet: Learning Reactive Self-driving Simulations from Real-world Observations

Abstract:In this work, we present a simple end-to-end trainable machine learning system capable of realistically simulating driving experiences. This can be used for the verification of self-driving system performance without relying on expensive and time-consuming road testing. In particular, we frame the simulation problem as a Markov Process, leveraging deep neural networks to model both state distribution and transition function. These are trainable directly from the existing raw observations without the need for any handcrafting in the form of plant or kinematic models. All that is needed is a dataset of historical traffic episodes. Our formulation allows the system to construct never seen scenes that unfold realistically reacting to the self-driving car's behaviour. We train our system directly from 1,000 hours of driving logs and measure both realism, reactivity of the simulation as the two key properties of the simulation. At the same time, we apply the method to evaluate the performance of a recently proposed state-of-the-art ML planning system trained from human driving logs. We discover this planning system is prone to previously unreported causal confusion issues that are difficult to test by non-reactive simulation. To the best of our knowledge, this is the first work that directly merges highly realistic data-driven simulations with a closed-loop evaluation for self-driving vehicles. We make the data, code, and pre-trained models publicly available to further stimulate simulation development.

* Published at 2021 International Conference on Robotics and Automation (ICRA2021)

Via

Access Paper or Ask Questions

Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps

Nov 10, 2020

Lukas Platinsky, Michal Szabados, Filip Hlasek, Ross Hemsley, Luca Del Pero, Andrej Pancik, Bryan Baum, Hugo Grimmett, Peter Ondruska

Figure 1 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps

Figure 2 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps

Figure 3 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps

Figure 4 for Collaborative Augmented Reality on Smartphones via Life-long City-scale Maps

Abstract:In this paper we present the first published end-to-end production computer-vision system for powering city-scale shared augmented reality experiences on mobile devices. In doing so we propose a new formulation for an experience-based mapping framework as an effective solution to the key issues of city-scale SLAM scalability, robustness, map updates and all-time all-weather performance required by a production system. Furthermore, we propose an effective way of synchronising SLAM systems to deliver seamless real-time localisation of multiple edge devices at the same time. All this in the presence of network latency and bandwidth limitations. The resulting system is deployed and tested at scale in San Francisco where it delivers AR experiences in a mapped area of several hundred kilometers. To foster further development of this area we offer the data set to the public, constituting the largest of this kind to date.

* Published at ISMAR 2020, http://www.bluevisionlabs.org

Via

Access Paper or Ask Questions

One Thousand and One Hours: Self-driving Motion Prediction Dataset

Jun 25, 2020

John Houston, Guido Zuidhof, Luca Bergamini, Yawei Ye, Ashesh Jain, Sammy Omari, Vladimir Iglovikov, Peter Ondruska

Figure 1 for One Thousand and One Hours: Self-driving Motion Prediction Dataset

Figure 2 for One Thousand and One Hours: Self-driving Motion Prediction Dataset

Figure 3 for One Thousand and One Hours: Self-driving Motion Prediction Dataset

Figure 4 for One Thousand and One Hours: Self-driving Motion Prediction Dataset

Abstract:We present the largest self-driving dataset for motion prediction to date, with over 1,000 hours of data. This was collected by a fleet of 20 autonomous vehicles along a fixed route in Palo Alto, California over a four-month period. It consists of 170,000 scenes, where each scene is 25 seconds long and captures the perception output of the self-driving system, which encodes the precise positions and motions of nearby vehicles, cyclists, and pedestrians over time. On top of this, the dataset contains a high-definition semantic map with 15,242 labelled elements and a high-definition aerial view over the area. Together with the provided software kit, this collection forms the largest, most complete and detailed dataset to date for the development of self-driving, machine learning tasks such as motion forecasting, planning and simulation. The full dataset is available at http://level5.lyft.com/.

* The full dataset is available at http://level5.lyft.com/

Via

Access Paper or Ask Questions

VALUE: Large Scale Voting-based Automatic Labelling for Urban Environments

Jun 05, 2020

Giacomo Dabisias, Emanuele Ruffaldi, Hugo Grimmett, Peter Ondruska

Figure 1 for VALUE: Large Scale Voting-based Automatic Labelling for Urban Environments

Figure 2 for VALUE: Large Scale Voting-based Automatic Labelling for Urban Environments

Figure 3 for VALUE: Large Scale Voting-based Automatic Labelling for Urban Environments

Figure 4 for VALUE: Large Scale Voting-based Automatic Labelling for Urban Environments

Abstract:This paper presents a simple and robust method for the automatic localisation of static 3D objects in large-scale urban environments. By exploiting the potential to merge a large volume of noisy but accurately localised 2D image data, we achieve superior performance in terms of both robustness and accuracy of the recovered 3D information. The method is based on a simple distributed voting schema which can be fully distributed and parallelised to scale to large-scale scenarios. To evaluate the method we collected city-scale data sets from New York City and San Francisco consisting of almost 400k images spanning the area of 40 km$^2$ and used it to accurately recover the 3D positions of traffic lights. We demonstrate a robust performance and also show that the solution improves in quality over time as the amount of data increases.

* Presented at ICRA-2018 conference, 20-25th May 2018, Brisbane, Australia

Via

Access Paper or Ask Questions

Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks

Apr 19, 2017

Julie Dequaire, Dushyant Rao, Peter Ondruska, Dominic Wang, Ingmar Posner

Figure 1 for Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks

Figure 2 for Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks

Figure 3 for Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks

Figure 4 for Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks

Abstract:This paper presents an end-to-end approach for tracking static and dynamic objects for an autonomous vehicle driving through crowded urban environments. Unlike traditional approaches to tracking, this method is learned end-to-end, and is able to directly predict a full unoccluded occupancy grid map from raw laser input data. Inspired by the recently presented DeepTracking approach [Ondruska, 2016], we employ a recurrent neural network (RNN) to capture the temporal evolution of the state of the environment, and propose to use Spatial Transformer modules to exploit estimates of the egomotion of the vehicle. Our results demonstrate the ability to track a range of objects, including cars, buses, pedestrians, and cyclists through occlusion, from both moving and stationary platforms, using a single learned model. Experimental results demonstrate that the model can also predict the future states of objects from current inputs, with greater accuracy than previous work.

Via

Access Paper or Ask Questions