Picture for Jianhao Wang

Jianhao Wang

Offline Meta Reinforcement Learning with In-Distribution Online Adaptation

Add code
Jun 01, 2023
Viaarxiv icon

Latent-Variable Advantage-Weighted Policy Optimization for Offline RL

Add code
Mar 16, 2022
Figure 1 for Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Figure 2 for Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Figure 3 for Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Figure 4 for Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Viaarxiv icon

Self-Organized Polynomial-Time Coordination Graphs

Add code
Dec 07, 2021
Figure 1 for Self-Organized Polynomial-Time Coordination Graphs
Figure 2 for Self-Organized Polynomial-Time Coordination Graphs
Figure 3 for Self-Organized Polynomial-Time Coordination Graphs
Figure 4 for Self-Organized Polynomial-Time Coordination Graphs
Viaarxiv icon

Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

Add code
Nov 22, 2021
Figure 1 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Figure 2 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Figure 3 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Figure 4 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Viaarxiv icon

LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates

Add code
Oct 15, 2021
Figure 1 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Figure 2 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Figure 3 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Figure 4 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Viaarxiv icon

Offline Reinforcement Learning with Reverse Model-based Imagination

Add code
Oct 01, 2021
Figure 1 for Offline Reinforcement Learning with Reverse Model-based Imagination
Figure 2 for Offline Reinforcement Learning with Reverse Model-based Imagination
Figure 3 for Offline Reinforcement Learning with Reverse Model-based Imagination
Figure 4 for Offline Reinforcement Learning with Reverse Model-based Imagination
Viaarxiv icon

Efficient Hierarchical Exploration with Stable Subgoal Representation Learning

Add code
May 31, 2021
Figure 1 for Efficient Hierarchical Exploration with Stable Subgoal Representation Learning
Figure 2 for Efficient Hierarchical Exploration with Stable Subgoal Representation Learning
Figure 3 for Efficient Hierarchical Exploration with Stable Subgoal Representation Learning
Figure 4 for Efficient Hierarchical Exploration with Stable Subgoal Representation Learning
Viaarxiv icon

QPLEX: Duplex Dueling Multi-Agent Q-Learning

Add code
Aug 03, 2020
Figure 1 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 2 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 3 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Figure 4 for QPLEX: Duplex Dueling Multi-Agent Q-Learning
Viaarxiv icon

Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning

Add code
Jun 23, 2020
Figure 1 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 2 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Figure 3 for Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Viaarxiv icon

Learn to Effectively Explore in Context-Based Meta-RL

Add code
Jun 15, 2020
Figure 1 for Learn to Effectively Explore in Context-Based Meta-RL
Figure 2 for Learn to Effectively Explore in Context-Based Meta-RL
Figure 3 for Learn to Effectively Explore in Context-Based Meta-RL
Figure 4 for Learn to Effectively Explore in Context-Based Meta-RL
Viaarxiv icon