Picture for Qiaomin Xie

Qiaomin Xie

Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Add code
Oct 16, 2024
Viaarxiv icon

Stable Offline Value Function Learning with Bisimulation-based Representations

Add code
Oct 02, 2024
Viaarxiv icon

Inception: Efficiently Computable Misinformation Attacks on Markov Games

Add code
Jun 24, 2024
Figure 1 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Figure 2 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Viaarxiv icon

Roping in Uncertainty: Robustness and Regularization in Markov Games

Add code
Jun 13, 2024
Viaarxiv icon

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Add code
Jun 07, 2024
Viaarxiv icon

When is exponential asymptotic optimality achievable in average-reward restless bandits?

Add code
May 28, 2024
Viaarxiv icon

The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize

Add code
May 27, 2024
Viaarxiv icon

Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA

Add code
Apr 09, 2024
Viaarxiv icon

Unichain and Aperiodicity are Sufficient for Asymptotic Optimality of Average-Reward Restless Bandits

Add code
Feb 08, 2024
Viaarxiv icon

Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation

Add code
Jan 25, 2024
Viaarxiv icon