Picture for Randy Jia

Randy Jia

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Add code
Oct 26, 2023
Viaarxiv icon

Contextual Bandits for Evaluating and Improving Inventory Control Policies

Add code
Oct 24, 2023
Figure 1 for Contextual Bandits for Evaluating and Improving Inventory Control Policies
Viaarxiv icon

Linear Reinforcement Learning with Ball Structure Action Space

Add code
Nov 14, 2022
Viaarxiv icon

Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management

Add code
May 10, 2019
Figure 1 for Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management
Viaarxiv icon

Posterior sampling for reinforcement learning: worst-case regret bounds

Add code
May 19, 2017
Viaarxiv icon