Picture for Scott Fujimoto

Scott Fujimoto

Imbalanced Gradients in RL Post-Training of Multi-Task LLMs

Add code
Oct 22, 2025
Viaarxiv icon

Towards General-Purpose Model-Free Reinforcement Learning

Add code
Jan 27, 2025
Figure 1 for Towards General-Purpose Model-Free Reinforcement Learning
Figure 2 for Towards General-Purpose Model-Free Reinforcement Learning
Figure 3 for Towards General-Purpose Model-Free Reinforcement Learning
Figure 4 for Towards General-Purpose Model-Free Reinforcement Learning
Viaarxiv icon

Fairness in Reinforcement Learning with Bisimulation Metrics

Add code
Dec 22, 2024
Figure 1 for Fairness in Reinforcement Learning with Bisimulation Metrics
Figure 2 for Fairness in Reinforcement Learning with Bisimulation Metrics
Figure 3 for Fairness in Reinforcement Learning with Bisimulation Metrics
Figure 4 for Fairness in Reinforcement Learning with Bisimulation Metrics
Viaarxiv icon

Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank

Add code
Oct 01, 2024
Figure 1 for Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Figure 2 for Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Figure 3 for Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Figure 4 for Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Viaarxiv icon

Imitation Learning from Observation through Optimal Transport

Add code
Oct 02, 2023
Figure 1 for Imitation Learning from Observation through Optimal Transport
Figure 2 for Imitation Learning from Observation through Optimal Transport
Figure 3 for Imitation Learning from Observation through Optimal Transport
Figure 4 for Imitation Learning from Observation through Optimal Transport
Viaarxiv icon

For SALE: State-Action Representation Learning for Deep Reinforcement Learning

Add code
Jun 04, 2023
Figure 1 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 2 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 3 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 4 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Viaarxiv icon

IL-flOw: Imitation Learning from Observation using Normalizing Flows

Add code
May 19, 2022
Figure 1 for IL-flOw: Imitation Learning from Observation using Normalizing Flows
Figure 2 for IL-flOw: Imitation Learning from Observation using Normalizing Flows
Figure 3 for IL-flOw: Imitation Learning from Observation using Normalizing Flows
Figure 4 for IL-flOw: Imitation Learning from Observation using Normalizing Flows
Viaarxiv icon

Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error

Add code
Jan 28, 2022
Figure 1 for Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Figure 2 for Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Figure 3 for Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Figure 4 for Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Viaarxiv icon

A Minimalist Approach to Offline Reinforcement Learning

Add code
Jun 12, 2021
Figure 1 for A Minimalist Approach to Offline Reinforcement Learning
Figure 2 for A Minimalist Approach to Offline Reinforcement Learning
Figure 3 for A Minimalist Approach to Offline Reinforcement Learning
Figure 4 for A Minimalist Approach to Offline Reinforcement Learning
Viaarxiv icon

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Add code
Jun 12, 2021
Figure 1 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Figure 2 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Figure 3 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Figure 4 for A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Viaarxiv icon