Picture for Hao Tang

Hao Tang

PartRAG: Retrieval-Augmented Part-Level 3D Generation and Editing

Add code
Feb 19, 2026
Viaarxiv icon

StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation

Add code
Feb 18, 2026
Viaarxiv icon

MMA: Multimodal Memory Agent

Add code
Feb 18, 2026
Viaarxiv icon

MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation

Add code
Feb 16, 2026
Viaarxiv icon

Code2Worlds: Empowering Coding LLMs for 4D World Generation

Add code
Feb 12, 2026
Viaarxiv icon

Light4D: Training-Free Extreme Viewpoint 4D Video Relighting

Add code
Feb 12, 2026
Viaarxiv icon

GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning

Add code
Feb 04, 2026
Viaarxiv icon

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

SayNext-Bench: Why Do LLMs Struggle with Next-Utterance Prediction?

Add code
Jan 30, 2026
Viaarxiv icon

Hallucination Begins Where Saliency Drops

Add code
Jan 28, 2026
Viaarxiv icon