Picture for Tao Lin

Tao Lin

RODS: Reward-Driven Online Data Synthesis for Multi-Turn Tool-Use Agents

Add code
Jun 17, 2026
Viaarxiv icon

Do More Agents Help? Controlled and Protocol-Aligned Evaluation of LLM Agent Workflows

Add code
Jun 04, 2026
Viaarxiv icon

Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

Add code
Jun 02, 2026
Viaarxiv icon

Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance

Add code
May 22, 2026
Viaarxiv icon

Evo-Depth: A Lightweight Depth-Enhanced Vision-Language-Action Model

Add code
May 14, 2026
Viaarxiv icon

Focusable Monocular Depth Estimation

Add code
May 12, 2026
Viaarxiv icon

Exploring Spatial Intelligence from a Generative Perspective

Add code
Apr 22, 2026
Viaarxiv icon

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Add code
Apr 22, 2026
Viaarxiv icon

Self-Adversarial One Step Generation via Condition Shifting

Add code
Apr 14, 2026
Viaarxiv icon

Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training

Add code
Mar 17, 2026
Viaarxiv icon