Picture for Xin Zhou

Xin Zhou

Singapore Management University

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Add code
Apr 09, 2026
Viaarxiv icon

PointTPA: Dynamic Network Parameter Adaptation for 3D Scene Understanding

Add code
Apr 06, 2026
Viaarxiv icon

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Add code
Mar 26, 2026
Viaarxiv icon

Learn Hard Problems During RL with Reference Guided Fine-tuning

Add code
Mar 05, 2026
Viaarxiv icon

SSKG Hub: An Expert-Guided Platform for LLM-Empowered Sustainability Standards Knowledge Graphs

Add code
Feb 28, 2026
Viaarxiv icon

NavDreamer: Video Models as Zero-Shot 3D Navigators

Add code
Feb 10, 2026
Viaarxiv icon

CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression

Add code
Feb 05, 2026
Viaarxiv icon

USS-Nav: Unified Spatio-Semantic Scene Graph for Lightweight UAV Zero-Shot Object Navigation

Add code
Feb 03, 2026
Viaarxiv icon

SP^2DPO: An LLM-assisted Semantic Per-Pair DPO Generalization

Add code
Jan 29, 2026
Viaarxiv icon

SLM-SS: Speech Language Model for Generative Speech Separation

Add code
Jan 27, 2026
Viaarxiv icon