Picture for Shuang Chen

Shuang Chen

Motion-Adaptive Multi-Scale Temporal Modelling with Skeleton-Constrained Spatial Graphs for Efficient 3D Human Pose Estimation

Add code
Apr 04, 2026
Viaarxiv icon

Revealing the Learning Dynamics of Long-Context Continual Pre-training

Add code
Apr 03, 2026
Viaarxiv icon

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Add code
Apr 03, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Add code
Mar 30, 2026
Viaarxiv icon

UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation

Add code
Mar 23, 2026
Viaarxiv icon

Supervise-assisted Multi-modality Fusion Diffusion Model for PET Restoration

Add code
Feb 12, 2026
Viaarxiv icon

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Exploring Reasoning Reward Model for Agents

Add code
Jan 29, 2026
Viaarxiv icon

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon