Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

Add code
Feb 23, 2023
Figure 1 for Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
Figure 2 for Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: