Picture for Dongbin Zhao

Dongbin Zhao

Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model

Add code
Dec 22, 2024
Viaarxiv icon

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning

Add code
Dec 12, 2024
Viaarxiv icon

Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving

Add code
Dec 03, 2024
Viaarxiv icon

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Add code
Oct 15, 2024
Viaarxiv icon

SELU: Self-Learning Embodied MLLMs in Unknown Environments

Add code
Oct 04, 2024
Viaarxiv icon

Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning

Add code
Aug 01, 2024
Viaarxiv icon

PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

Add code
Jun 04, 2024
Figure 1 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Figure 2 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Figure 3 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Figure 4 for PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning
Viaarxiv icon

Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning

Add code
May 20, 2024
Viaarxiv icon

Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer

Add code
Mar 15, 2024
Viaarxiv icon

FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game

Add code
Feb 01, 2024
Viaarxiv icon