Picture for Linjian Meng

Linjian Meng

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment

Add code
Oct 22, 2024
Viaarxiv icon

Efficient Last-iterate Convergence Algorithms in Solving Games

Add code
Aug 22, 2023
Viaarxiv icon

Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game

Add code
Mar 28, 2022
Figure 1 for Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game
Figure 2 for Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game
Figure 3 for Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game
Figure 4 for Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game
Viaarxiv icon