Picture for Chen-Yu Wei

Chen-Yu Wei

Decision Making in Hybrid Environments: A Model Aggregation Approach

Add code
Feb 09, 2025
Viaarxiv icon

Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback

Add code
Nov 11, 2024
Figure 1 for Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback
Viaarxiv icon

How Does Variance Shape the Regret in Contextual Bandits?

Add code
Oct 16, 2024
Viaarxiv icon

Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification

Add code
Oct 10, 2024
Figure 1 for Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification
Figure 2 for Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification
Figure 3 for Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification
Viaarxiv icon

Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data

Add code
Mar 25, 2024
Figure 1 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Figure 2 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Figure 3 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Figure 4 for Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
Viaarxiv icon

Tractable Local Equilibria in Non-Concave Games

Add code
Mar 13, 2024
Figure 1 for Tractable Local Equilibria in Non-Concave Games
Figure 2 for Tractable Local Equilibria in Non-Concave Games
Figure 3 for Tractable Local Equilibria in Non-Concave Games
Figure 4 for Tractable Local Equilibria in Non-Concave Games
Viaarxiv icon

Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games

Add code
Jan 26, 2024
Viaarxiv icon

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Add code
Oct 17, 2023
Viaarxiv icon

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

Add code
Sep 02, 2023
Viaarxiv icon

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

Add code
Jun 20, 2023
Viaarxiv icon