Picture for Shuo Zhang

Shuo Zhang

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Add code
Jan 16, 2026
Viaarxiv icon

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Add code
Jan 14, 2026
Viaarxiv icon

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Add code
Jan 13, 2026
Viaarxiv icon

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Add code
Jan 13, 2026
Viaarxiv icon

Controlled Self-Evolution for Algorithmic Code Optimization

Add code
Jan 13, 2026
Viaarxiv icon

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Add code
Jan 11, 2026
Viaarxiv icon

KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

Add code
Jan 08, 2026
Viaarxiv icon

Sphinx: Benchmarking and Modeling for LLM-Driven Pull Request Review

Add code
Jan 06, 2026
Viaarxiv icon

The Empowerment of Science of Science by Large Language Models: New Tools and Methods

Add code
Nov 19, 2025
Viaarxiv icon

Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation Model

Add code
Nov 17, 2025
Figure 1 for Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation Model
Figure 2 for Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation Model
Figure 3 for Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation Model
Figure 4 for Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation Model
Viaarxiv icon