Picture for Yanxing Qi

Yanxing Qi

Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing

Add code
Nov 24, 2023
Viaarxiv icon