Picture for Chunhua Shen

Chunhua Shen

The University of Adelaide

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Add code
Jun 05, 2026
Viaarxiv icon

HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling

Add code
Jun 03, 2026
Viaarxiv icon

Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

Add code
Jun 02, 2026
Viaarxiv icon

Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration?

Add code
May 31, 2026
Viaarxiv icon

Geo-Align: Video Generation Alignment via Metric Geometry Reward

Add code
May 22, 2026
Viaarxiv icon

MARBLE: Multi-Aspect Reward Balance for Diffusion RL

Add code
May 07, 2026
Viaarxiv icon

Unlocking the Power of Critical Factors for 3D Visual Geometry Estimation

Add code
Apr 23, 2026
Viaarxiv icon

MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation

Add code
Apr 22, 2026
Viaarxiv icon

Exploring Spatial Intelligence from a Generative Perspective

Add code
Apr 22, 2026
Viaarxiv icon

OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering

Add code
Apr 09, 2026
Viaarxiv icon