Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management

Add code
May 10, 2019
Figure 1 for Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: