Picture for Shixiang Gu

Shixiang Gu

Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization

Add code
Jun 23, 2020
Figure 1 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 2 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 3 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Figure 4 for Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
Viaarxiv icon

Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning

Add code
Apr 27, 2020
Figure 1 for Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning
Figure 2 for Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning
Figure 3 for Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning
Figure 4 for Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning
Viaarxiv icon

A Divergence Minimization Perspective on Imitation Learning Methods

Add code
Nov 06, 2019
Figure 1 for A Divergence Minimization Perspective on Imitation Learning Methods
Figure 2 for A Divergence Minimization Perspective on Imitation Learning Methods
Figure 3 for A Divergence Minimization Perspective on Imitation Learning Methods
Figure 4 for A Divergence Minimization Perspective on Imitation Learning Methods
Viaarxiv icon

Why Does Hierarchy Work So Well in Reinforcement Learning?

Add code
Sep 23, 2019
Figure 1 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 2 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 3 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 4 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Viaarxiv icon

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real

Add code
Aug 13, 2019
Figure 1 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 2 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 3 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 4 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Viaarxiv icon

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

Add code
Jul 08, 2019
Figure 1 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 2 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 3 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 4 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Viaarxiv icon

Dynamics-Aware Unsupervised Discovery of Skills

Add code
Jul 02, 2019
Figure 1 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 2 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 3 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 4 for Dynamics-Aware Unsupervised Discovery of Skills
Viaarxiv icon

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Add code
Jun 18, 2019
Figure 1 for Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Figure 2 for Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Figure 3 for Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Figure 4 for Language as an Abstraction for Hierarchical Deep Reinforcement Learning
Viaarxiv icon

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives

Add code
Oct 09, 2018
Figure 1 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 2 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 3 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 4 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Viaarxiv icon

Data-Efficient Hierarchical Reinforcement Learning

Add code
Oct 05, 2018
Figure 1 for Data-Efficient Hierarchical Reinforcement Learning
Figure 2 for Data-Efficient Hierarchical Reinforcement Learning
Figure 3 for Data-Efficient Hierarchical Reinforcement Learning
Figure 4 for Data-Efficient Hierarchical Reinforcement Learning
Viaarxiv icon