Picture for Dongbin Zhao

Dongbin Zhao

ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

Add code
Feb 08, 2025
Viaarxiv icon

Dream to Drive with Predictive Individual World Model

Add code
Jan 28, 2025
Viaarxiv icon

Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model

Add code
Dec 22, 2024
Viaarxiv icon

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning

Add code
Dec 12, 2024
Viaarxiv icon

Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving

Add code
Dec 03, 2024
Viaarxiv icon

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Add code
Oct 15, 2024
Viaarxiv icon

SELU: Self-Learning Embodied MLLMs in Unknown Environments

Add code
Oct 04, 2024
Viaarxiv icon

Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning

Add code
Aug 01, 2024
Viaarxiv icon

PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

Add code
Jun 04, 2024
Figure 1 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Figure 2 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Figure 3 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Figure 4 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Viaarxiv icon

Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning

Add code
May 20, 2024
Viaarxiv icon