Picture for Kaituo Feng

Kaituo Feng

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

Add code
May 20, 2026
Viaarxiv icon

From Web to Pixels: Bringing Agentic Search into Visual Perception

Add code
May 12, 2026
Viaarxiv icon

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Add code
May 06, 2026
Viaarxiv icon

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Add code
Apr 02, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Add code
Mar 30, 2026
Viaarxiv icon

Exploring Reasoning Reward Model for Agents

Add code
Jan 29, 2026
Viaarxiv icon

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Add code
Dec 19, 2025
Figure 1 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 2 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 3 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Figure 4 for AdaTooler-V: Adaptive Tool-Use for Images and Videos
Viaarxiv icon

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Add code
Oct 10, 2025
Viaarxiv icon

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Add code
Jun 11, 2025
Figure 1 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Figure 2 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Figure 3 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Figure 4 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Viaarxiv icon