Picture for Haipeng Luo

Haipeng Luo

On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games

Add code
Mar 04, 2025
Viaarxiv icon

Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback

Add code
Feb 24, 2025
Viaarxiv icon

Simultaneous Swap Regret Minimization via KL-Calibration

Add code
Feb 23, 2025
Viaarxiv icon

Contextual Linear Bandits with Delay as Payoff

Add code
Feb 20, 2025
Viaarxiv icon

Alternating Regret for Online Convex Optimization

Add code
Feb 18, 2025
Viaarxiv icon

Corrupted Learning Dynamics in Games

Add code
Dec 10, 2024
Viaarxiv icon

Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

Add code
Jul 15, 2024
Viaarxiv icon

Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms

Add code
Jun 15, 2024
Figure 1 for Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
Figure 2 for Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
Figure 3 for Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
Figure 4 for Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
Viaarxiv icon

Provably Efficient Interactive-Grounded Learning with Personalized Reward

Add code
May 31, 2024
Viaarxiv icon

No-Regret Learning for Fair Multi-Agent Social Welfare Optimization

Add code
May 31, 2024
Viaarxiv icon