Picture for Zhongwen Xu

Zhongwen Xu

Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

Add code
Sep 29, 2023
Figure 1 for Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Figure 2 for Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Figure 3 for Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Figure 4 for Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Viaarxiv icon

Visual Imitation Learning with Patch Rewards

Add code
Feb 10, 2023
Figure 1 for Visual Imitation Learning with Patch Rewards
Figure 2 for Visual Imitation Learning with Patch Rewards
Figure 3 for Visual Imitation Learning with Patch Rewards
Figure 4 for Visual Imitation Learning with Patch Rewards
Viaarxiv icon

Learning to Optimize for Reinforcement Learning

Add code
Feb 03, 2023
Viaarxiv icon

Reinforcement Learning from Diverse Human Preferences

Add code
Jan 30, 2023
Viaarxiv icon

Benchmarking Deformable Object Manipulation with Differentiable Physics

Add code
Oct 24, 2022
Viaarxiv icon

RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning

Add code
Oct 18, 2022
Figure 1 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 2 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 3 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Figure 4 for RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Viaarxiv icon

Boosting Offline Reinforcement Learning via Data Rebalancing

Add code
Oct 17, 2022
Figure 1 for Boosting Offline Reinforcement Learning via Data Rebalancing
Figure 2 for Boosting Offline Reinforcement Learning via Data Rebalancing
Figure 3 for Boosting Offline Reinforcement Learning via Data Rebalancing
Figure 4 for Boosting Offline Reinforcement Learning via Data Rebalancing
Viaarxiv icon

Mutual Information Regularized Offline Reinforcement Learning

Add code
Oct 14, 2022
Figure 1 for Mutual Information Regularized Offline Reinforcement Learning
Figure 2 for Mutual Information Regularized Offline Reinforcement Learning
Figure 3 for Mutual Information Regularized Offline Reinforcement Learning
Figure 4 for Mutual Information Regularized Offline Reinforcement Learning
Viaarxiv icon

Efficient Offline Policy Optimization with a Learned Model

Add code
Oct 12, 2022
Figure 1 for Efficient Offline Policy Optimization with a Learned Model
Figure 2 for Efficient Offline Policy Optimization with a Learned Model
Figure 3 for Efficient Offline Policy Optimization with a Learned Model
Figure 4 for Efficient Offline Policy Optimization with a Learned Model
Viaarxiv icon

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

Add code
Jun 25, 2022
Figure 1 for Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Figure 2 for Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Figure 3 for Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Figure 4 for Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Viaarxiv icon