Picture for Jerry Zhu

Jerry Zhu

The Delusional Hedge Algorithm as a Model of Human Learning from Diverse Opinions

Add code
Feb 21, 2024
Viaarxiv icon

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Add code
Jun 21, 2021
Figure 1 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 2 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 3 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 4 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Viaarxiv icon

Corruption-Robust Offline Reinforcement Learning

Add code
Jun 11, 2021
Figure 1 for Corruption-Robust Offline Reinforcement Learning
Viaarxiv icon