Picture for Thang Duong

Thang Duong

Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM

Add code
May 16, 2025
Viaarxiv icon

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

Add code
Mar 13, 2022
Figure 1 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 2 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 3 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 4 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Viaarxiv icon