Picture for Hengshuang Zhao

Hengshuang Zhao

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation

Add code
Jun 17, 2025
Viaarxiv icon

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Add code
Jun 17, 2025
Viaarxiv icon

PlayerOne: Egocentric World Simulator

Add code
Jun 11, 2025
Viaarxiv icon

LayerFlow: A Unified Model for Layer-aware Video Generation

Add code
Jun 04, 2025
Viaarxiv icon

GenSpace: Benchmarking Spatially-Aware Image Generation

Add code
May 30, 2025
Viaarxiv icon

Depth Anything with Any Prior

Add code
May 15, 2025
Viaarxiv icon

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

Add code
Apr 03, 2025
Viaarxiv icon

Empowering Large Language Models with 3D Situation Awareness

Add code
Mar 29, 2025
Viaarxiv icon

Sonata: Self-Supervised Learning of Reliable Point Representations

Add code
Mar 20, 2025
Viaarxiv icon

Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation

Add code
Mar 11, 2025
Viaarxiv icon