Picture for Chen-Yu Wei

Chen-Yu Wei

Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback

Add code
Nov 11, 2024
Viaarxiv icon

How Does Variance Shape the Regret in Contextual Bandits?

Add code
Oct 16, 2024
Viaarxiv icon

Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification

Add code
Oct 10, 2024
Viaarxiv icon

Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data

Add code
Mar 25, 2024
Viaarxiv icon

Tractable Local Equilibria in Non-Concave Games

Add code
Mar 13, 2024
Figure 1 for Tractable Local Equilibria in Non-Concave Games
Figure 2 for Tractable Local Equilibria in Non-Concave Games
Figure 3 for Tractable Local Equilibria in Non-Concave Games
Figure 4 for Tractable Local Equilibria in Non-Concave Games
Viaarxiv icon

Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games

Add code
Jan 26, 2024
Viaarxiv icon

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Add code
Oct 17, 2023
Viaarxiv icon

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

Add code
Sep 02, 2023
Viaarxiv icon

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

Add code
Jun 20, 2023
Viaarxiv icon

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Add code
May 30, 2023
Viaarxiv icon