Picture for Hirota Kinoshita

Hirota Kinoshita

A Relative-Budget Theory for Reinforcement Learning with Verifiable Rewards in Large Language Model Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

Mechanism design with multi-armed bandit

Add code
Nov 30, 2024
Figure 1 for Mechanism design with multi-armed bandit
Figure 2 for Mechanism design with multi-armed bandit
Figure 3 for Mechanism design with multi-armed bandit
Figure 4 for Mechanism design with multi-armed bandit
Viaarxiv icon