Picture for Thomas Lampe

Thomas Lampe

Preference Optimization as Probabilistic Inference

Add code
Oct 05, 2024
Viaarxiv icon

Game On: Towards Language Models as RL Experimenters

Add code
Sep 05, 2024
Figure 1 for Game On: Towards Language Models as RL Experimenters
Figure 2 for Game On: Towards Language Models as RL Experimenters
Figure 3 for Game On: Towards Language Models as RL Experimenters
Figure 4 for Game On: Towards Language Models as RL Experimenters
Viaarxiv icon

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Feb 08, 2024
Figure 1 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 2 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 3 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 4 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Viaarxiv icon

Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning

Add code
Feb 08, 2024
Figure 1 for Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning
Figure 2 for Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning
Figure 3 for Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning
Figure 4 for Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning
Viaarxiv icon

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Add code
Dec 18, 2023
Figure 1 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 2 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 3 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 4 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Viaarxiv icon

Replay across Experiments: A Natural Extension of Off-Policy RL

Add code
Nov 28, 2023
Viaarxiv icon

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

Add code
Jun 20, 2023
Viaarxiv icon

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

Add code
Dec 03, 2022
Figure 1 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 2 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 3 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Figure 4 for SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration
Viaarxiv icon

How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation

Add code
May 06, 2022
Figure 1 for How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Figure 2 for How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Figure 3 for How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Figure 4 for How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation
Viaarxiv icon

Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

Add code
Nov 03, 2021
Figure 1 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Figure 2 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Figure 3 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Figure 4 for Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes
Viaarxiv icon