Picture for Hengyuan Hu

Hengyuan Hu

What's the Move? Hybrid Imitation Learning via Salient Points

Add code
Dec 06, 2024
Figure 1 for What's the Move? Hybrid Imitation Learning via Salient Points
Figure 2 for What's the Move? Hybrid Imitation Learning via Salient Points
Figure 3 for What's the Move? Hybrid Imitation Learning via Salient Points
Figure 4 for What's the Move? Hybrid Imitation Learning via Salient Points
Viaarxiv icon

Imitation Bootstrapped Reinforcement Learning

Add code
Nov 20, 2023
Figure 1 for Imitation Bootstrapped Reinforcement Learning
Figure 2 for Imitation Bootstrapped Reinforcement Learning
Figure 3 for Imitation Bootstrapped Reinforcement Learning
Figure 4 for Imitation Bootstrapped Reinforcement Learning
Viaarxiv icon

Toward Grounded Social Reasoning

Add code
Jun 14, 2023
Figure 1 for Toward Grounded Social Reasoning
Figure 2 for Toward Grounded Social Reasoning
Figure 3 for Toward Grounded Social Reasoning
Figure 4 for Toward Grounded Social Reasoning
Viaarxiv icon

The Update Equivalence Framework for Decision-Time Planning

Add code
Apr 25, 2023
Figure 1 for The Update Equivalence Framework for Decision-Time Planning
Figure 2 for The Update Equivalence Framework for Decision-Time Planning
Figure 3 for The Update Equivalence Framework for Decision-Time Planning
Figure 4 for The Update Equivalence Framework for Decision-Time Planning
Viaarxiv icon

Language Instructed Reinforcement Learning for Human-AI Coordination

Add code
Apr 13, 2023
Viaarxiv icon

Human-AI Coordination via Human-Regularized Search and Learning

Add code
Oct 11, 2022
Figure 1 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 2 for Human-AI Coordination via Human-Regularized Search and Learning
Figure 3 for Human-AI Coordination via Human-Regularized Search and Learning
Viaarxiv icon

K-level Reasoning for Zero-Shot Coordination in Hanabi

Add code
Jul 14, 2022
Figure 1 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 2 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 3 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Figure 4 for K-level Reasoning for Zero-Shot Coordination in Hanabi
Viaarxiv icon

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Add code
Sep 30, 2021
Figure 1 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 2 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 3 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 4 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Viaarxiv icon

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Add code
Jun 16, 2021
Figure 1 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 2 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 3 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 4 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Viaarxiv icon

Off-Belief Learning

Add code
Mar 06, 2021
Figure 1 for Off-Belief Learning
Figure 2 for Off-Belief Learning
Figure 3 for Off-Belief Learning
Figure 4 for Off-Belief Learning
Viaarxiv icon