Picture for Chenjia Bai

Chenjia Bai

VLP: Vision-Language Preference Learning for Embodied Manipulation

Add code
Feb 17, 2025
Viaarxiv icon

Radiology Report Generation via Multi-objective Preference Optimization

Add code
Dec 12, 2024
Figure 1 for Radiology Report Generation via Multi-objective Preference Optimization
Figure 2 for Radiology Report Generation via Multi-objective Preference Optimization
Figure 3 for Radiology Report Generation via Multi-objective Preference Optimization
Figure 4 for Radiology Report Generation via Multi-objective Preference Optimization
Viaarxiv icon

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Add code
Oct 28, 2024
Figure 1 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 2 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 3 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Figure 4 for ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Viaarxiv icon

Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control

Add code
Oct 17, 2024
Figure 1 for Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control
Figure 2 for Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control
Figure 3 for Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control
Figure 4 for Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control
Viaarxiv icon

Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner

Add code
Sep 30, 2024
Figure 1 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Figure 2 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Figure 3 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Figure 4 for Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Viaarxiv icon

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies

Add code
Sep 09, 2024
Figure 1 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Figure 2 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Figure 3 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Figure 4 for Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Viaarxiv icon

SelfBC: Self Behavior Cloning for Offline Reinforcement Learning

Add code
Aug 04, 2024
Viaarxiv icon

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

Add code
Jun 22, 2024
Viaarxiv icon

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

Add code
May 30, 2024
Figure 1 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Figure 2 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Figure 3 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Figure 4 for SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Viaarxiv icon

Constrained Ensemble Exploration for Unsupervised Skill Discovery

Add code
May 25, 2024
Viaarxiv icon