Picture for Hao Tang

Hao Tang

MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation

Add code
Feb 16, 2026
Viaarxiv icon

Light4D: Training-Free Extreme Viewpoint 4D Video Relighting

Add code
Feb 12, 2026
Viaarxiv icon

Code2Worlds: Empowering Coding LLMs for 4D World Generation

Add code
Feb 12, 2026
Viaarxiv icon

GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning

Add code
Feb 04, 2026
Viaarxiv icon

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

SayNext-Bench: Why Do LLMs Struggle with Next-Utterance Prediction?

Add code
Jan 30, 2026
Viaarxiv icon

Hallucination Begins Where Saliency Drops

Add code
Jan 28, 2026
Viaarxiv icon

FourierPET: Deep Fourier-based Unrolled Network for Low-count PET Reconstruction

Add code
Jan 16, 2026
Viaarxiv icon

Cross-modal Proxy Evolving for OOD Detection with Vision-Language Models

Add code
Jan 13, 2026
Viaarxiv icon

3D CoCa v2: Contrastive Learners with Test-Time Search for Generalizable Spatial Intelligence

Add code
Jan 10, 2026
Viaarxiv icon