Picture for Hengyuan Hu

Hengyuan Hu

What's the Move? Hybrid Imitation Learning via Salient Points

Add code
Dec 06, 2024
Viaarxiv icon

Imitation Bootstrapped Reinforcement Learning

Add code
Nov 20, 2023
Viaarxiv icon

Toward Grounded Social Reasoning

Add code
Jun 14, 2023
Figure 1 for Toward Grounded Social Reasoning
Figure 2 for Toward Grounded Social Reasoning
Figure 3 for Toward Grounded Social Reasoning
Figure 4 for Toward Grounded Social Reasoning
Viaarxiv icon

The Update Equivalence Framework for Decision-Time Planning

Add code
Apr 25, 2023
Figure 1 for The Update Equivalence Framework for Decision-Time Planning
Figure 2 for The Update Equivalence Framework for Decision-Time Planning
Figure 3 for The Update Equivalence Framework for Decision-Time Planning
Figure 4 for The Update Equivalence Framework for Decision-Time Planning
Viaarxiv icon

Language Instructed Reinforcement Learning for Human-AI Coordination

Add code
Apr 13, 2023
Viaarxiv icon

Human-AI Coordination via Human-Regularized Search and Learning

Add code
Oct 11, 2022
Figure 1 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 2 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 3 for Human-AI Coordination via Human-Regularized Search and Learning
Viaarxiv icon

K-level Reasoning for Zero-Shot Coordination in Hanabi

Add code
Jul 14, 2022
Figure 1 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 2 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 3 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 4 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Viaarxiv icon

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Add code
Sep 30, 2021
Figure 1 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 2 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 3 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 4 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Viaarxiv icon

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Add code
Jun 16, 2021
Figure 1 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 2 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 3 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 4 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Viaarxiv icon

Off-Belief Learning

Add code
Mar 06, 2021
Figure 1 for Off-Belief Learning
Figure 2 for Off-Belief Learning
Figure 3 for Off-Belief Learning
Figure 4 for Off-Belief Learning
Viaarxiv icon