Picture for Min-hwan Oh

Min-hwan Oh

Seoul National University

Practical and Optimal Algorithm for Linear Contextual Bandits with Rare Parameter Updates

Add code
May 31, 2026
Viaarxiv icon

Nonstationary Generalized Linear Bandits with Discounted Online Mirror Descent

Add code
May 25, 2026
Viaarxiv icon

Latent Representation Alignment for Offline Goal-Conditioned Reinforcement Learning

Add code
May 25, 2026
Viaarxiv icon

Optimal Design for Multinomial Logit Model with Applications to Best Assortment Identification

Add code
May 25, 2026
Viaarxiv icon

Multi-Step Likelihood-Ratio Correction for Reinforcement Learning with Verifiable Rewards

Add code
May 20, 2026
Viaarxiv icon

Block-Sphere Vector Quantization

Add code
May 19, 2026
Viaarxiv icon

Peng's Q($λ$) for Conservative Value Estimation in Offline Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

RelFlexformer: Efficient Attention 3D-Transformers for Integrable Relative Positional Encodings

Add code
May 11, 2026
Viaarxiv icon

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning

Add code
May 06, 2026
Viaarxiv icon

Blessings of Multiple Good Arms in Multi-Objective Linear Bandits

Add code
Feb 13, 2026
Viaarxiv icon