Picture for Kwang-Sung Jun

Kwang-Sung Jun

Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification

Add code
Nov 04, 2024
Figure 1 for Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification
Figure 2 for Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification
Viaarxiv icon

HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning

Add code
Nov 01, 2024
Viaarxiv icon

Minimum Empirical Divergence for Sub-Gaussian Linear Bandits

Add code
Oct 31, 2024
Figure 1 for Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
Figure 2 for Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
Figure 3 for Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
Figure 4 for Minimum Empirical Divergence for Sub-Gaussian Linear Bandits
Viaarxiv icon

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Add code
Jul 19, 2024
Viaarxiv icon

Adaptive Experimentation When You Can't Experiment

Add code
Jun 15, 2024
Viaarxiv icon

Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits

Add code
Feb 17, 2024
Figure 1 for Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Figure 2 for Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Figure 3 for Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Viaarxiv icon

Better-than-KL PAC-Bayes Bounds

Add code
Feb 14, 2024
Viaarxiv icon

Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization

Add code
Feb 12, 2024
Viaarxiv icon

Graph Sparsifications using Neural Network Assisted Monte Carlo Tree Search

Add code
Nov 17, 2023
Figure 1 for Graph Sparsifications using Neural Network Assisted Monte Carlo Tree Search
Figure 2 for Graph Sparsifications using Neural Network Assisted Monte Carlo Tree Search
Figure 3 for Graph Sparsifications using Neural Network Assisted Monte Carlo Tree Search
Figure 4 for Graph Sparsifications using Neural Network Assisted Monte Carlo Tree Search
Viaarxiv icon

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion

Add code
Oct 28, 2023
Viaarxiv icon