Picture for Qiwei Di

Qiwei Di

Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers

Add code
Oct 18, 2024
Viaarxiv icon

Relative-Translation Invariant Wasserstein Distance

Add code
Sep 04, 2024
Viaarxiv icon

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Add code
Apr 16, 2024
Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Add code
Feb 14, 2024
Viaarxiv icon

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning

Add code
Oct 02, 2023
Figure 1 for Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
Viaarxiv icon

Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

Add code
Oct 02, 2023
Viaarxiv icon