Picture for Juanzi Li

Juanzi Li

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Add code
Mar 12, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

WildReward: Learning Reward Models from In-the-Wild Human Interactions

Add code
Feb 09, 2026
Viaarxiv icon

MM-THEBench: Do Reasoning MLLMs Think Reasonably?

Add code
Jan 30, 2026
Viaarxiv icon

On the Paradoxical Interference between Instruction-Following and Task Solving

Add code
Jan 29, 2026
Viaarxiv icon

RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension

Add code
Jan 14, 2026
Viaarxiv icon

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Add code
Jan 09, 2026
Viaarxiv icon

WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection

Add code
Oct 21, 2025
Viaarxiv icon

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Add code
Oct 02, 2025
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon