Picture for Jiechao Xiong

Jiechao Xiong

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning

Add code
Oct 21, 2024
Figure 1 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Figure 2 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Figure 3 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Figure 4 for Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Viaarxiv icon

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning

Add code
Nov 30, 2020
Figure 1 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Figure 2 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Figure 3 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Figure 4 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Viaarxiv icon

TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game

Add code
Nov 27, 2020
Figure 1 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Figure 2 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Figure 3 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Figure 4 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Viaarxiv icon

Zeroth-Order Supervised Policy Improvement

Add code
Jun 11, 2020
Figure 1 for Zeroth-Order Supervised Policy Improvement
Figure 2 for Zeroth-Order Supervised Policy Improvement
Figure 3 for Zeroth-Order Supervised Policy Improvement
Figure 4 for Zeroth-Order Supervised Policy Improvement
Viaarxiv icon

Arena: a toolkit for Multi-Agent Reinforcement Learning

Add code
Jul 20, 2019
Figure 1 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Figure 2 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Figure 3 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Figure 4 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Viaarxiv icon

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

Add code
Nov 02, 2018
Figure 1 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Figure 2 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Figure 3 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Figure 4 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Viaarxiv icon

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

Add code
Oct 10, 2018
Figure 1 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Figure 2 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Figure 3 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Figure 4 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Viaarxiv icon

A Margin-based MLE for Crowdsourced Partial Ranking

Add code
Jul 29, 2018
Figure 1 for A Margin-based MLE for Crowdsourced Partial Ranking
Figure 2 for A Margin-based MLE for Crowdsourced Partial Ranking
Figure 3 for A Margin-based MLE for Crowdsourced Partial Ranking
Figure 4 for A Margin-based MLE for Crowdsourced Partial Ranking
Viaarxiv icon

From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation

Add code
Mar 08, 2018
Figure 1 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Figure 2 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Figure 3 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Figure 4 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Viaarxiv icon

Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size

Add code
Jan 31, 2018
Figure 1 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Figure 2 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Figure 3 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Figure 4 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Viaarxiv icon