Picture for Dhruva Tirumala

Dhruva Tirumala

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning

Add code
May 03, 2024
Viaarxiv icon

Replay across Experiments: A Natural Extension of Off-Policy RL

Add code
Nov 28, 2023
Viaarxiv icon

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Add code
Apr 26, 2023
Viaarxiv icon

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

Add code
Dec 03, 2022
Figure 1 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 2 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 3 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 4 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Viaarxiv icon

MO2: Model-Based Offline Options

Add code
Sep 05, 2022
Figure 1 for MO2: Model-Based Offline Options
Figure 2 for MO2: Model-Based Offline Options
Figure 3 for MO2: Model-Based Offline Options
Figure 4 for MO2: Model-Based Offline Options
Viaarxiv icon

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

Add code
Dec 09, 2021
Figure 1 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Figure 2 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Figure 3 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Figure 4 for Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Viaarxiv icon

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

Add code
Oct 08, 2021
Viaarxiv icon

Behavior Priors for Efficient Reinforcement Learning

Add code
Oct 27, 2020
Figure 1 for Behavior Priors for Efficient Reinforcement Learning
Figure 2 for Behavior Priors for Efficient Reinforcement Learning
Figure 3 for Behavior Priors for Efficient Reinforcement Learning
Figure 4 for Behavior Priors for Efficient Reinforcement Learning
Viaarxiv icon

Data-efficient Hindsight Off-policy Option Learning

Add code
Jul 30, 2020
Figure 1 for Data-efficient Hindsight Off-policy Option Learning
Figure 2 for Data-efficient Hindsight Off-policy Option Learning
Figure 3 for Data-efficient Hindsight Off-policy Option Learning
Figure 4 for Data-efficient Hindsight Off-policy Option Learning
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Add code
Sep 26, 2019
Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon