Picture for Chenjia Bai

Chenjia Bai

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Add code
Oct 28, 2024
Viaarxiv icon

Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control

Add code
Oct 17, 2024
Viaarxiv icon

Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner

Add code
Sep 30, 2024
Figure 1 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Figure 2 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Figure 3 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Figure 4 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Viaarxiv icon

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies

Add code
Sep 09, 2024
Figure 1 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Figure 2 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Figure 3 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Figure 4 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Viaarxiv icon

SelfBC: Self Behavior Cloning for Offline Reinforcement Learning

Add code
Aug 04, 2024
Viaarxiv icon

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Add code
Jun 22, 2024
Viaarxiv icon

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

Add code
May 30, 2024
Viaarxiv icon

Constrained Ensemble Exploration for Unsupervised Skill Discovery

Add code
May 25, 2024
Viaarxiv icon

Cross-Domain Policy Adaptation by Capturing Representation Mismatch

Add code
May 24, 2024
Viaarxiv icon

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

Add code
May 23, 2024
Viaarxiv icon