Picture for Michael Janner

Michael Janner

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

H-GAP: Humanoid Control with a Generalist Planner

Add code
Dec 05, 2023
Viaarxiv icon

Deep Generative Models for Decision-Making and Control

Add code
Jun 15, 2023
Figure 1 for Deep Generative Models for Decision-Making and Control
Figure 2 for Deep Generative Models for Decision-Making and Control
Figure 3 for Deep Generative Models for Decision-Making and Control
Figure 4 for Deep Generative Models for Decision-Making and Control
Viaarxiv icon

Training Diffusion Models with Reinforcement Learning

Add code
May 23, 2023
Viaarxiv icon

IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies

Add code
Apr 20, 2023
Figure 1 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 2 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 3 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 4 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Viaarxiv icon

Efficient Planning in a Compact Latent Action Space

Add code
Aug 25, 2022
Figure 1 for Efficient Planning in a Compact Latent Action Space
Figure 2 for Efficient Planning in a Compact Latent Action Space
Figure 3 for Efficient Planning in a Compact Latent Action Space
Figure 4 for Efficient Planning in a Compact Latent Action Space
Viaarxiv icon

Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control

Add code
Jun 21, 2022
Figure 1 for Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control
Figure 2 for Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control
Figure 3 for Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control
Figure 4 for Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control
Viaarxiv icon

Planning with Diffusion for Flexible Behavior Synthesis

Add code
May 20, 2022
Figure 1 for Planning with Diffusion for Flexible Behavior Synthesis
Figure 2 for Planning with Diffusion for Flexible Behavior Synthesis
Figure 3 for Planning with Diffusion for Flexible Behavior Synthesis
Figure 4 for Planning with Diffusion for Flexible Behavior Synthesis
Viaarxiv icon

Reinforcement Learning as One Big Sequence Modeling Problem

Add code
Jun 03, 2021
Figure 1 for Reinforcement Learning as One Big Sequence Modeling Problem
Figure 2 for Reinforcement Learning as One Big Sequence Modeling Problem
Figure 3 for Reinforcement Learning as One Big Sequence Modeling Problem
Figure 4 for Reinforcement Learning as One Big Sequence Modeling Problem
Viaarxiv icon

$γ$-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction

Add code
Oct 27, 2020
Figure 1 for $γ$-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction
Figure 2 for $γ$-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction
Figure 3 for $γ$-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction
Figure 4 for $γ$-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction
Viaarxiv icon