Picture for Tanut Treetanthiploet

Tanut Treetanthiploet

$ε$-Policy Gradient for Online Pricing

Add code
May 06, 2024
Viaarxiv icon

Insurance pricing on price comparison websites via reinforcement learning

Add code
Aug 14, 2023
Figure 1 for Insurance pricing on price comparison websites via reinforcement learning
Figure 2 for Insurance pricing on price comparison websites via reinforcement learning
Viaarxiv icon

Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

Add code
Aug 11, 2022
Viaarxiv icon

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

Add code
Dec 19, 2021
Viaarxiv icon

Correlated Bandits for Dynamic Pricing via the ARC algorithm

Add code
Feb 08, 2021
Figure 1 for Correlated Bandits for Dynamic Pricing via the ARC algorithm
Figure 2 for Correlated Bandits for Dynamic Pricing via the ARC algorithm
Figure 3 for Correlated Bandits for Dynamic Pricing via the ARC algorithm
Figure 4 for Correlated Bandits for Dynamic Pricing via the ARC algorithm
Viaarxiv icon

Asymptotic Randomised Control with applications to bandits

Add code
Oct 14, 2020
Figure 1 for Asymptotic Randomised Control with applications to bandits
Figure 2 for Asymptotic Randomised Control with applications to bandits
Figure 3 for Asymptotic Randomised Control with applications to bandits
Figure 4 for Asymptotic Randomised Control with applications to bandits
Viaarxiv icon