Picture for Bryan Dai

Bryan Dai

Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments

Add code
Feb 03, 2026
Viaarxiv icon

AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis

Add code
Dec 29, 2025
Viaarxiv icon

M2G-Eval: Enhancing and Evaluating Multi-granularity Multilingual Code Generation

Add code
Dec 27, 2025
Viaarxiv icon

Context as a Tool: Context Management for Long-Horizon SWE-Agents

Add code
Dec 26, 2025
Viaarxiv icon

Universal Reasoning Model

Add code
Dec 24, 2025
Viaarxiv icon

Scaling Laws for Code: Every Programming Language Matters

Add code
Dec 15, 2025
Figure 1 for Scaling Laws for Code: Every Programming Language Matters
Figure 2 for Scaling Laws for Code: Every Programming Language Matters
Figure 3 for Scaling Laws for Code: Every Programming Language Matters
Figure 4 for Scaling Laws for Code: Every Programming Language Matters
Viaarxiv icon

Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning

Add code
May 27, 2025
Viaarxiv icon

One-shot Entropy Minimization

Add code
May 27, 2025
Viaarxiv icon

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Add code
Feb 20, 2025
Figure 1 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Figure 2 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Figure 3 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Figure 4 for Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Viaarxiv icon