Picture for Xiaoteng Ma

Xiaoteng Ma

Efficient Multi-agent Reinforcement Learning by Planning

Add code
May 20, 2024
Viaarxiv icon

SEABO: A Simple Search-Based Method for Offline Imitation Learning

Add code
Feb 06, 2024
Viaarxiv icon

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?

Add code
Jun 02, 2023
Figure 1 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Figure 2 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Figure 3 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Figure 4 for What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Viaarxiv icon

Cross-Domain Policy Adaptation via Value-Guided Data Filtering

Add code
May 28, 2023
Viaarxiv icon

Learning Diverse Risk Preferences in Population-based Self-play

Add code
May 19, 2023
Viaarxiv icon

Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning

Add code
Apr 10, 2023
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Jan 27, 2023
Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Sep 29, 2022
Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Exploiting Reward Shifting in Value-Based Deep RL

Add code
Sep 15, 2022
Figure 1 for Exploiting Reward Shifting in Value-Based Deep RL
Figure 2 for Exploiting Reward Shifting in Value-Based Deep RL
Figure 3 for Exploiting Reward Shifting in Value-Based Deep RL
Figure 4 for Exploiting Reward Shifting in Value-Based Deep RL
Viaarxiv icon

Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning

Add code
Jun 15, 2022
Figure 1 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 2 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 3 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 4 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Viaarxiv icon