Picture for Shuang Chen

Shuang Chen

Neural Flow Operators can Approximate any Operator: Abstract Frameworks and Universal Approcimations

Add code
May 21, 2026
Viaarxiv icon

Checkup2Action: A Multimodal Clinical Check-up Report Dataset for Patient-Oriented Action Card Generation

Add code
May 13, 2026
Viaarxiv icon

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Add code
May 11, 2026
Viaarxiv icon

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Add code
May 07, 2026
Viaarxiv icon

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Add code
May 06, 2026
Viaarxiv icon

Diffusion Model as a Generalist Segmentation Learner

Add code
Apr 27, 2026
Viaarxiv icon

Motion-Adaptive Multi-Scale Temporal Modelling with Skeleton-Constrained Spatial Graphs for Efficient 3D Human Pose Estimation

Add code
Apr 04, 2026
Viaarxiv icon

Revealing the Learning Dynamics of Long-Context Continual Pre-training

Add code
Apr 03, 2026
Viaarxiv icon

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Add code
Apr 03, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon