Picture for Zhiyuan Yao

Zhiyuan Yao

FinTrace: Holistic Trajectory-Level Evaluation of LLM Tool Calling for Long-Horizon Financial Tasks

Add code
Apr 11, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

$V_0$: A Generalist Value Model for Any Policy at State Zero

Add code
Feb 03, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Add code
Jan 13, 2026
Viaarxiv icon

RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction

Add code
Jan 11, 2026
Viaarxiv icon

Does Memory Need Graphs? A Unified Framework and Empirical Analysis for Long-Term Dialog Memory

Add code
Jan 07, 2026
Viaarxiv icon

Octopus: Agentic Multimodal Reasoning with Six-Capability Orchestration

Add code
Nov 19, 2025
Viaarxiv icon

Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base

Add code
Oct 30, 2025
Figure 1 for Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
Figure 2 for Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
Figure 3 for Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
Figure 4 for Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base
Viaarxiv icon