Picture for Jiaxuan Gao

Jiaxuan Gao

Few-shot In-Context Preference Learning Using Large Language Models

Add code
Oct 22, 2024
Figure 1 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 2 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 3 for Few-shot In-Context Preference Learning Using Large Language Models
Figure 4 for Few-shot In-Context Preference Learning Using Large Language Models
Viaarxiv icon

On Designing Effective RL Reward at Training Time for LLM Reasoning

Add code
Oct 19, 2024
Figure 1 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Figure 2 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Figure 3 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Figure 4 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Viaarxiv icon

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Add code
Apr 16, 2024
Viaarxiv icon

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination

Add code
Jan 09, 2024
Viaarxiv icon

Language-Guided Generation of Physically Realistic Robot Motion and Control

Add code
Jun 18, 2023
Figure 1 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Figure 2 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Figure 3 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Figure 4 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Viaarxiv icon

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

Add code
Feb 03, 2023
Figure 1 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Figure 2 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Figure 3 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Figure 4 for Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Viaarxiv icon

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration

Add code
Jan 09, 2023
Figure 1 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 2 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 3 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Figure 4 for Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Viaarxiv icon

Learning Efficient Multi-Agent Cooperative Visual Exploration

Add code
Oct 12, 2021
Figure 1 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Figure 2 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Figure 3 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Figure 4 for Learning Efficient Multi-Agent Cooperative Visual Exploration
Viaarxiv icon