Picture for Kaito Ariu

Kaito Ariu

Last Iterate Convergence in Monotone Mean Field Games

Add code
Oct 07, 2024
Viaarxiv icon

Matroid Semi-Bandits in Sublinear Time

Add code
May 28, 2024
Viaarxiv icon

Filtered Direct Preference Optimization

Add code
Apr 23, 2024
Figure 1 for Filtered Direct Preference Optimization
Figure 2 for Filtered Direct Preference Optimization
Figure 3 for Filtered Direct Preference Optimization
Figure 4 for Filtered Direct Preference Optimization
Viaarxiv icon

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment

Add code
Apr 05, 2024
Viaarxiv icon

Return-Aligned Decision Transformer

Add code
Feb 06, 2024
Viaarxiv icon

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding

Add code
Jan 05, 2024
Viaarxiv icon

Model-Based Minimum Bayes Risk Decoding

Add code
Nov 09, 2023
Viaarxiv icon

On Uniformly Optimal Algorithms for Best Arm Identification in Two-Armed Bandits with Fixed Budget

Add code
Sep 04, 2023
Viaarxiv icon

Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model

Add code
Jun 18, 2023
Viaarxiv icon

A Slingshot Approach to Learning in Monotone Games

Add code
May 26, 2023
Viaarxiv icon