Picture for Zhaochen Su

Zhaochen Su

May

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Add code
Mar 12, 2026
Viaarxiv icon

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Add code
Feb 26, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Add code
Oct 29, 2025
Viaarxiv icon

GRACE: Generative Representation Learning via Contrastive Policy Optimization

Add code
Oct 06, 2025
Viaarxiv icon

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Add code
Jun 04, 2025
Figure 1 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Figure 2 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Figure 3 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Figure 4 for Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
Viaarxiv icon

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting

Add code
May 24, 2025
Viaarxiv icon

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Add code
May 13, 2025
Figure 1 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 2 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 3 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 4 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Viaarxiv icon

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Add code
Mar 27, 2025
Viaarxiv icon

Iterative Value Function Optimization for Guided Decoding

Add code
Mar 05, 2025
Viaarxiv icon