Picture for Jiaxuan Gao

Jiaxuan Gao

Few-shot In-Context Preference Learning Using Large Language Models

Add code
Oct 22, 2024
Viaarxiv icon

On Designing Effective RL Reward at Training Time for LLM Reasoning

Add code
Oct 19, 2024
Viaarxiv icon

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Add code
Apr 16, 2024
Viaarxiv icon

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination

Add code
Jan 09, 2024
Viaarxiv icon

Language-Guided Generation of Physically Realistic Robot Motion and Control

Add code
Jun 18, 2023
Viaarxiv icon

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

Add code
Feb 03, 2023
Viaarxiv icon

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration

Add code
Jan 09, 2023
Figure 1 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 2 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 3 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 4 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Viaarxiv icon

Learning Efficient Multi-Agent Cooperative Visual Exploration

Add code
Oct 12, 2021
Figure 1 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Figure 2 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Figure 3 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Figure 4 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Viaarxiv icon