Picture for Junya Honda

Junya Honda

Multi-Player Approaches for Dueling Bandits

Add code
May 25, 2024
Figure 1 for Multi-Player Approaches for Dueling Bandits
Figure 2 for Multi-Player Approaches for Dueling Bandits
Figure 3 for Multi-Player Approaches for Dueling Bandits
Figure 4 for Multi-Player Approaches for Dueling Bandits
Viaarxiv icon

Learning with Posterior Sampling for Revenue Management under Time-varying Demand

Add code
May 08, 2024
Viaarxiv icon

Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds

Add code
Mar 10, 2024
Viaarxiv icon

Follow-the-Perturbed-Leader with Fréchet-type Tail Distributions: Optimality in Adversarial Bandits and Best-of-Both-Worlds

Add code
Mar 08, 2024
Viaarxiv icon

Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring

Add code
Feb 13, 2024
Viaarxiv icon

Thompson Exploration with Best Challenger Rule in Best Arm Identification

Add code
Oct 01, 2023
Viaarxiv icon

Stability-penalty-adaptive Follow-the-regularized-leader: Sparsity, Game-dependency, and Best-of-both-worlds

Add code
May 26, 2023
Figure 1 for Stability-penalty-adaptive Follow-the-regularized-leader: Sparsity, Game-dependency, and Best-of-both-worlds
Figure 2 for Stability-penalty-adaptive Follow-the-regularized-leader: Sparsity, Game-dependency, and Best-of-both-worlds
Figure 3 for Stability-penalty-adaptive Follow-the-regularized-leader: Sparsity, Game-dependency, and Best-of-both-worlds
Viaarxiv icon

A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms

Add code
Mar 10, 2023
Viaarxiv icon

Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits

Add code
Feb 03, 2023
Viaarxiv icon

Best-of-Both-Worlds Algorithms for Partial Monitoring

Add code
Jul 29, 2022
Figure 1 for Best-of-Both-Worlds Algorithms for Partial Monitoring
Viaarxiv icon