Picture for Haobo Fu

Haobo Fu

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning

Add code
Oct 21, 2024
Viaarxiv icon

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent

Add code
Apr 22, 2024
Viaarxiv icon

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination

Add code
Mar 05, 2024
Viaarxiv icon

Enhance Reasoning for Large Language Models in the Game Werewolf

Add code
Feb 04, 2024
Viaarxiv icon

Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing

Add code
Dec 22, 2023
Viaarxiv icon

Pointer Networks Trained Better via Evolutionary Algorithms

Add code
Dec 06, 2023
Figure 1 for Pointer Networks Trained Better via Evolutionary Algorithms
Figure 2 for Pointer Networks Trained Better via Evolutionary Algorithms
Figure 3 for Pointer Networks Trained Better via Evolutionary Algorithms
Figure 4 for Pointer Networks Trained Better via Evolutionary Algorithms
Viaarxiv icon

Diversity from Human Feedback

Add code
Oct 10, 2023
Viaarxiv icon

Policy Space Diversity for Non-Transitive Games

Add code
Jun 29, 2023
Viaarxiv icon

Maximum Entropy Heterogeneous-Agent Mirror Learning

Add code
Jun 19, 2023
Viaarxiv icon

L2E: Learning to Exploit Your Opponent

Add code
Feb 18, 2021
Figure 1 for L2E: Learning to Exploit Your Opponent
Figure 2 for L2E: Learning to Exploit Your Opponent
Figure 3 for L2E: Learning to Exploit Your Opponent
Figure 4 for L2E: Learning to Exploit Your Opponent
Viaarxiv icon