Picture for Xiaoyu Tan

Xiaoyu Tan

INF Technology

PRISM: Festina Lente Proactivity -- Risk-Sensitive, Uncertainty-Aware Deliberation for Proactive Agents

Add code
Feb 02, 2026
Viaarxiv icon

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Add code
Jan 27, 2026
Viaarxiv icon

Curiosity Driven Knowledge Retrieval for Mobile Agents

Add code
Jan 27, 2026
Viaarxiv icon

SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot Learning

Add code
Jan 08, 2026
Viaarxiv icon

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Add code
Dec 31, 2025
Viaarxiv icon

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Add code
Dec 31, 2025
Viaarxiv icon

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Add code
Dec 26, 2025
Viaarxiv icon

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Add code
Sep 26, 2025
Figure 1 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 2 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 3 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Figure 4 for Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Viaarxiv icon

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Add code
Sep 09, 2025
Figure 1 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 2 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 3 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 4 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Viaarxiv icon

Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents

Add code
Mar 11, 2025
Figure 1 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Figure 2 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Figure 3 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Figure 4 for Guess What I am Thinking: A Benchmark for Inner Thought Reasoning of Role-Playing Language Agents
Viaarxiv icon