Picture for Hanwang Zhang

Hanwang Zhang

Seeing World Dynamics in a Nutshell

Add code
Feb 05, 2025
Viaarxiv icon

Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation

Add code
Jan 27, 2025
Figure 1 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 2 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 3 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 4 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Viaarxiv icon

Pushing Rendering Boundaries: Hard Gaussian Splatting

Add code
Dec 06, 2024
Viaarxiv icon

HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing

Add code
Dec 05, 2024
Viaarxiv icon

LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair

Add code
Nov 28, 2024
Viaarxiv icon

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

Add code
Nov 25, 2024
Figure 1 for CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Figure 2 for CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Figure 3 for CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Figure 4 for CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Viaarxiv icon

Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

Add code
Nov 25, 2024
Viaarxiv icon

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Add code
Nov 24, 2024
Figure 1 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 2 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 3 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 4 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Viaarxiv icon

Robust Fine-tuning of Zero-shot Models via Variance Reduction

Add code
Nov 11, 2024
Figure 1 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 2 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 3 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 4 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Viaarxiv icon

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Figure 1 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 2 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 3 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 4 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Viaarxiv icon