Picture for Seungki Min

Seungki Min

Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits

Add code
Aug 28, 2024
Viaarxiv icon

An Information-Theoretic Analysis of Nonstationary Bandit Learning

Add code
Feb 09, 2023
Viaarxiv icon

Policy Gradient Optimization of Thompson Sampling Policies

Add code
Jun 30, 2020
Figure 1 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 2 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 3 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 4 for Policy Gradient Optimization of Thompson Sampling Policies
Viaarxiv icon

Thompson Sampling with Information Relaxation Penalties

Add code
Feb 12, 2019
Figure 1 for Thompson Sampling with Information Relaxation Penalties
Figure 2 for Thompson Sampling with Information Relaxation Penalties
Figure 3 for Thompson Sampling with Information Relaxation Penalties
Figure 4 for Thompson Sampling with Information Relaxation Penalties
Viaarxiv icon