Picture for Xuefeng Gao

Xuefeng Gao

Reward-Directed Score-Based Diffusion Models via q-Learning

Add code
Sep 07, 2024
Viaarxiv icon

Regret Bounds for Episodic Risk-Sensitive Linear Quadratic Regulator

Add code
Jun 08, 2024
Viaarxiv icon

Reinforcement Learning for Intensity Control: An Application to Choice-Based Network Revenue Management

Add code
Jun 08, 2024
Viaarxiv icon

Reinforcement Learning for Jump-Diffusions

Add code
May 26, 2024
Viaarxiv icon

No Algorithmic Collusion in Two-Player Blindfolded Game with Thompson Sampling

Add code
May 23, 2024
Viaarxiv icon

Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances

Add code
Jan 31, 2024
Viaarxiv icon

Wasserstein Convergence Guarantees for a General Class of Score-Based Generative Models

Add code
Nov 18, 2023
Viaarxiv icon

Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents

Add code
Jan 30, 2023
Viaarxiv icon

Square-root regret bounds for continuous-time episodic Markov decision processes

Add code
Oct 03, 2022
Figure 1 for Square-root regret bounds for continuous-time episodic Markov decision processes
Viaarxiv icon

Logarithmic regret bounds for continuous-time average-reward Markov decision processes

Add code
May 24, 2022
Viaarxiv icon