Picture for Jost Tobias Springenberg

Jost Tobias Springenberg

Preference Optimization as Probabilistic Inference

Add code
Oct 05, 2024
Viaarxiv icon

Game On: Towards Language Models as RL Experimenters

Add code
Sep 05, 2024
Figure 1 for Game On: Towards Language Models as RL Experimenters
Figure 2 for Game On: Towards Language Models as RL Experimenters
Figure 3 for Game On: Towards Language Models as RL Experimenters
Figure 4 for Game On: Towards Language Models as RL Experimenters
Viaarxiv icon

Imitating Language via Scalable Inverse Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 2 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 3 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 4 for Imitating Language via Scalable Inverse Reinforcement Learning
Viaarxiv icon

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Feb 08, 2024
Figure 1 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 2 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 3 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 4 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Viaarxiv icon

GATS: Gather-Attend-Scatter

Add code
Jan 16, 2024
Viaarxiv icon

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Add code
Dec 18, 2023
Figure 1 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 2 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 3 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 4 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Viaarxiv icon

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

Add code
Jun 20, 2023
Viaarxiv icon

A Generalist Dynamics Model for Control

Add code
May 18, 2023
Viaarxiv icon

Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains

Add code
Feb 24, 2023
Viaarxiv icon

A Generalist Agent

Add code
May 19, 2022
Figure 1 for A Generalist Agent
Figure 2 for A Generalist Agent
Figure 3 for A Generalist Agent
Figure 4 for A Generalist Agent
Viaarxiv icon