Picture for Zicheng Hu

Zicheng Hu

A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond

Add code
Feb 11, 2025
Viaarxiv icon