Picture for Hanwang Zhang

Hanwang Zhang

Parallel Diffusion Solver via Residual Dirichlet Policy Optimization

Add code
Dec 28, 2025
Viaarxiv icon

DEPO: Dual-Efficiency Preference Optimization for LLM Agents

Add code
Nov 19, 2025
Viaarxiv icon

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Add code
Nov 14, 2025
Figure 1 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 2 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 3 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Figure 4 for WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation
Viaarxiv icon

NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos

Add code
Nov 11, 2025
Viaarxiv icon

Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning

Add code
Jul 10, 2025
Viaarxiv icon

DragNeXt: Rethinking Drag-Based Image Editing

Add code
Jun 09, 2025
Figure 1 for DragNeXt: Rethinking Drag-Based Image Editing
Figure 2 for DragNeXt: Rethinking Drag-Based Image Editing
Figure 3 for DragNeXt: Rethinking Drag-Based Image Editing
Figure 4 for DragNeXt: Rethinking Drag-Based Image Editing
Viaarxiv icon

3D Question Answering via only 2D Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

Exploring Consciousness in LLMs: A Systematic Survey of Theories, Implementations, and Frontier Risks

Add code
May 26, 2025
Viaarxiv icon

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Add code
May 23, 2025
Viaarxiv icon

Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image

Add code
May 20, 2025
Viaarxiv icon