Picture for Pan Xu

Pan Xu

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

Add code
Nov 15, 2024
Figure 1 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Figure 2 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Figure 3 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Figure 4 for Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
Viaarxiv icon

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

Add code
Oct 30, 2024
Viaarxiv icon

Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning

Add code
Sep 30, 2024
Figure 1 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Figure 2 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Figure 3 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Figure 4 for Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Viaarxiv icon

Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer

Add code
Aug 02, 2024
Viaarxiv icon

More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling

Add code
Jun 18, 2024
Viaarxiv icon

Optimal Batched Linear Bandits

Add code
Jun 06, 2024
Viaarxiv icon

Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

Add code
Apr 16, 2024
Figure 1 for Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

Add code
Mar 14, 2024
Viaarxiv icon

Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation

Add code
Feb 23, 2024
Viaarxiv icon

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Add code
Dec 24, 2023
Viaarxiv icon