Picture for Chi Jin

Chi Jin

Building Math Agents with Multi-Turn Iterative Preference Learning

Add code
Sep 04, 2024
Figure 1 for Building Math Agents with Multi-Turn Iterative Preference Learning
Figure 2 for Building Math Agents with Multi-Turn Iterative Preference Learning
Figure 3 for Building Math Agents with Multi-Turn Iterative Preference Learning
Figure 4 for Building Math Agents with Multi-Turn Iterative Preference Learning
Viaarxiv icon

Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization

Add code
Aug 21, 2024
Viaarxiv icon

Towards Principled Superhuman AI for Multiplayer Symmetric Games

Add code
Jun 06, 2024
Viaarxiv icon

On Limitation of Transformer for Learning HMMs

Add code
Jun 06, 2024
Viaarxiv icon

FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning

Add code
Jun 04, 2024
Viaarxiv icon

Tuning-Free Stochastic Optimization

Add code
Feb 12, 2024
Viaarxiv icon

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Add code
Nov 27, 2023
Viaarxiv icon

ZeroSwap: Data-driven Optimal Market Making in DeFi

Add code
Oct 13, 2023
Viaarxiv icon

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

Add code
Sep 29, 2023
Viaarxiv icon

Is RLHF More Difficult than Standard RL?

Add code
Jun 25, 2023
Viaarxiv icon