Picture for Seungki Min

Seungki Min

Seoul National University

On the Optimality of Tracking Fisher Information in Adaptive Testing with Stochastic Binary Responses

Add code
Oct 09, 2025
Viaarxiv icon

Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits

Add code
Aug 28, 2024
Viaarxiv icon

An Information-Theoretic Analysis of Nonstationary Bandit Learning

Add code
Feb 09, 2023
Viaarxiv icon

Policy Gradient Optimization of Thompson Sampling Policies

Add code
Jun 30, 2020
Figure 1 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 2 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 3 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 4 for Policy Gradient Optimization of Thompson Sampling Policies
Viaarxiv icon

Thompson Sampling with Information Relaxation Penalties

Add code
Feb 12, 2019
Figure 1 for Thompson Sampling with Information Relaxation Penalties
Figure 2 for Thompson Sampling with Information Relaxation Penalties
Figure 3 for Thompson Sampling with Information Relaxation Penalties
Figure 4 for Thompson Sampling with Information Relaxation Penalties
Viaarxiv icon