Picture for Chi Jin

Chi Jin

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

Add code
Mar 17, 2026
Viaarxiv icon

Automatic Generation of High-Performance RL Environments

Add code
Mar 12, 2026
Viaarxiv icon

DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering

Add code
Mar 10, 2026
Viaarxiv icon

AlgoVeri: An Aligned Benchmark for Verified Code Generation on Classical Algorithms

Add code
Feb 10, 2026
Viaarxiv icon

MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models

Add code
Dec 31, 2025
Viaarxiv icon

Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention

Add code
Nov 17, 2025
Figure 1 for Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention
Figure 2 for Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention
Figure 3 for Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention
Figure 4 for Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention
Viaarxiv icon

Frontier LLMs Still Struggle with Simple Reasoning Tasks

Add code
Jul 09, 2025
Figure 1 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Figure 2 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Figure 3 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Figure 4 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Viaarxiv icon

Principled Out-of-Distribution Generalization via Simplicity

Add code
May 28, 2025
Viaarxiv icon

Learning World Models for Interactive Video Generation

Add code
May 28, 2025
Viaarxiv icon

Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities

Add code
May 19, 2025
Viaarxiv icon