Picture for Xingshan Zeng

Xingshan Zeng

ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation

Add code
Feb 03, 2026
Viaarxiv icon

From Verifiable Dot to Reward Chain: Harnessing Verifiable Reference-based Rewards for Reinforcement Learning of Open-ended Generation

Add code
Jan 26, 2026
Viaarxiv icon

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Add code
Jan 13, 2026
Viaarxiv icon

Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents

Add code
Dec 23, 2025
Viaarxiv icon

Fast, Slow, and Tool-augmented Thinking for LLMs: A Review

Add code
Aug 17, 2025
Viaarxiv icon

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Add code
May 28, 2025
Viaarxiv icon

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning

Add code
May 23, 2025
Viaarxiv icon

The Real Barrier to LLM Agent Usability is Agentic ROI

Add code
May 23, 2025
Viaarxiv icon

ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution

Add code
May 12, 2025
Viaarxiv icon

Advancing and Benchmarking Personalized Tool Invocation for LLMs

Add code
May 07, 2025
Viaarxiv icon