Picture for Xianyuan Zhan

Xianyuan Zhan

Are Expressive Models Truly Necessary for Offline RL?

Add code
Dec 15, 2024
Viaarxiv icon

Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning

Add code
Oct 02, 2024
Viaarxiv icon

xTED: Cross-Domain Policy Adaptation via Diffusion-Based Trajectory Editing

Add code
Sep 13, 2024
Viaarxiv icon

Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning

Add code
Jul 29, 2024
Figure 1 for Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Figure 2 for Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Figure 3 for Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Figure 4 for Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Viaarxiv icon

Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies

Add code
Jun 26, 2024
Viaarxiv icon

Instruction-Guided Visual Masking

Add code
May 30, 2024
Figure 1 for Instruction-Guided Visual Masking
Figure 2 for Instruction-Guided Visual Masking
Figure 3 for Instruction-Guided Visual Masking
Figure 4 for Instruction-Guided Visual Masking
Viaarxiv icon

OMPO: A Unified Framework for RL under Policy and Dynamics Shifts

Add code
May 29, 2024
Figure 1 for OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Figure 2 for OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Figure 3 for OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Figure 4 for OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Viaarxiv icon

Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL

Add code
May 28, 2024
Viaarxiv icon

Policy Bifurcation in Safe Reinforcement Learning

Add code
Mar 28, 2024
Figure 1 for Policy Bifurcation in Safe Reinforcement Learning
Figure 2 for Policy Bifurcation in Safe Reinforcement Learning
Figure 3 for Policy Bifurcation in Safe Reinforcement Learning
Figure 4 for Policy Bifurcation in Safe Reinforcement Learning
Viaarxiv icon

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Add code
Feb 28, 2024
Viaarxiv icon