Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alex Krizhevsky

ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Dec 07, 2018

Mayank Bansal, Alex Krizhevsky, Abhijit Ogale

Figure 1 for ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Figure 2 for ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Figure 3 for ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Figure 4 for ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Abstract:Our goal is to train a policy for autonomous driving via imitation learning that is robust enough to drive a real vehicle. We find that standard behavior cloning is insufficient for handling complex driving scenarios, even when we leverage a perception system for preprocessing the input and a controller for executing the output on the car: 30 million examples are still not enough. We propose exposing the learner to synthesized data in the form of perturbations to the expert's driving, which creates interesting situations such as collisions and/or going off the road. Rather than purely imitating all data, we augment the imitation loss with additional losses that penalize undesirable events and encourage progress -- the perturbations then provide an important signal for these losses and lead to robustness of the learned model. We show that the ChauffeurNet model can handle complex situations in simulation, and present ablation experiments that emphasize the importance of each of our proposed changes and show that the model is responding to the appropriate causal factors. Finally, we demonstrate the model driving a car in the real world.

* Video results: https://sites.google.com/view/waymo-learn-to-drive

Via

Access Paper or Ask Questions

Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection

Aug 28, 2016

Sergey Levine, Peter Pastor, Alex Krizhevsky, Deirdre Quillen

Figure 1 for Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection

Figure 2 for Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection

Figure 3 for Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection

Figure 4 for Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection

Abstract:We describe a learning-based approach to hand-eye coordination for robotic grasping from monocular images. To learn hand-eye coordination for grasping, we trained a large convolutional neural network to predict the probability that task-space motion of the gripper will result in successful grasps, using only monocular camera images and independently of camera calibration or the current robot pose. This requires the network to observe the spatial relationship between the gripper and objects in the scene, thus learning hand-eye coordination. We then use this network to servo the gripper in real time to achieve successful grasps. To train our network, we collected over 800,000 grasp attempts over the course of two months, using between 6 and 14 robotic manipulators at any given time, with differences in camera placement and hardware. Our experimental evaluation demonstrates that our method achieves effective real-time control, can successfully grasp novel objects, and corrects mistakes by continuous servoing.

* This is an extended version of "Learning Hand-Eye Coordination for Robotic Grasping with Large-Scale Data Collection," ISER 2016. Draft modified to correct typo in Algorithm 1 and add a link to the publicly available dataset

Via

Access Paper or Ask Questions

One weird trick for parallelizing convolutional neural networks

Apr 26, 2014

Alex Krizhevsky

Figure 1 for One weird trick for parallelizing convolutional neural networks

Figure 2 for One weird trick for parallelizing convolutional neural networks

Figure 3 for One weird trick for parallelizing convolutional neural networks

Abstract:I present a new way to parallelize the training of convolutional neural networks across multiple GPUs. The method scales significantly better than all alternatives when applied to modern convolutional neural networks.

Via

Access Paper or Ask Questions

Improving neural networks by preventing co-adaptation of feature detectors

Jul 03, 2012

Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov

Figure 1 for Improving neural networks by preventing co-adaptation of feature detectors

Figure 2 for Improving neural networks by preventing co-adaptation of feature detectors

Figure 3 for Improving neural networks by preventing co-adaptation of feature detectors

Figure 4 for Improving neural networks by preventing co-adaptation of feature detectors

Abstract:When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data. This "overfitting" is greatly reduced by randomly omitting half of the feature detectors on each training case. This prevents complex co-adaptations in which a feature detector is only helpful in the context of several other specific feature detectors. Instead, each neuron learns to detect a feature that is generally helpful for producing the correct answer given the combinatorially large variety of internal contexts in which it must operate. Random "dropout" gives big improvements on many benchmark tasks and sets new records for speech and object recognition.

Via

Access Paper or Ask Questions