Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Oct 01, 2019

Cristian Bodnar, Adrian Li, Karol Hausman, Peter Pastor, Mrinal Kalakrishnan

Figure 1 for Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Figure 2 for Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Figure 3 for Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Figure 4 for Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Share this with someone who'll enjoy it:

Abstract:The distributional perspective on reinforcement learning (RL) has given rise to a series of successful Q-learning algorithms, resulting in state-of-the-art performance in arcade game environments. However, it has not yet been analyzed how these findings from a discrete setting translate to complex practical applications characterized by noisy, high dimensional and continuous state-action spaces. In this work, we propose Quantile QT-Opt (Q2-Opt), a distributional variant of the recently introduced distributed Q-learning algorithm for continuous domains, and examine its behaviour in a series of simulated and real vision-based robotic grasping tasks. The absence of an actor in Q2-Opt allows us to directly draw a parallel to the previous discrete experiments in the literature without the additional complexities induced by an actor-critic architecture. We demonstrate that Q2-Opt achieves a superior vision-based object grasping success rate, while also being more sample efficient. The distributional formulation also allows us to experiment with various risk-distortion metrics that give us an indication of how robots can concretely manage risk in practice using a Deep RL control policy. As an additional contribution, we perform experiments on offline datasets and compare them with the latest findings from discrete settings. Surprisingly, we find that there is a discrepancy between our results and the previous batch RL findings from the literature obtained on arcade game environments.

* Under review at ICRA 2020

View paper on

Share this with someone who'll enjoy it:

Title:Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping

Paper and Code