Picture for Yuchen Shi

Yuchen Shi

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Add code
Dec 31, 2025
Viaarxiv icon

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Add code
Dec 26, 2025
Viaarxiv icon

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 2 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 3 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 4 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Viaarxiv icon

FlowAgent: Achieving Compliance and Flexibility for Workflow Agents

Add code
Feb 20, 2025
Figure 1 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Figure 2 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Figure 3 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Figure 4 for FlowAgent: Achieving Compliance and Flexibility for Workflow Agents
Viaarxiv icon

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Add code
Jan 27, 2025
Figure 1 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Figure 2 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Figure 3 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Figure 4 for LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
Viaarxiv icon

Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach

Add code
Dec 09, 2024
Figure 1 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Figure 2 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Figure 3 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Figure 4 for Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach
Viaarxiv icon

Towards Fault Tolerance in Multi-Agent Reinforcement Learning

Add code
Nov 30, 2024
Figure 1 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Figure 2 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Figure 3 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Figure 4 for Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Viaarxiv icon

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

Add code
Sep 03, 2024
Figure 1 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Figure 2 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Figure 3 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Figure 4 for AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction
Viaarxiv icon

CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns

Add code
Aug 27, 2024
Figure 1 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Figure 2 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Figure 3 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Figure 4 for CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
Viaarxiv icon

Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration

Add code
Aug 21, 2024
Viaarxiv icon