Picture for Hanwang Zhang

Hanwang Zhang

Pushing Rendering Boundaries: Hard Gaussian Splatting

Add code
Dec 06, 2024
Viaarxiv icon

HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing

Add code
Dec 05, 2024
Viaarxiv icon

LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair

Add code
Nov 28, 2024
Viaarxiv icon

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

Add code
Nov 25, 2024
Viaarxiv icon

Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

Add code
Nov 25, 2024
Viaarxiv icon

AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea

Add code
Nov 24, 2024
Figure 1 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 2 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 3 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Figure 4 for AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Viaarxiv icon

Robust Fine-tuning of Zero-shot Models via Variance Reduction

Add code
Nov 11, 2024
Figure 1 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 2 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 3 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Figure 4 for Robust Fine-tuning of Zero-shot Models via Variance Reduction
Viaarxiv icon

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Figure 1 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 2 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 3 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 4 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Viaarxiv icon

Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting

Add code
Oct 25, 2024
Figure 1 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Figure 2 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Figure 3 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Figure 4 for Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
Viaarxiv icon

Few-shot NeRF by Adaptive Rendering Loss Regularization

Add code
Oct 23, 2024
Figure 1 for Few-shot NeRF by Adaptive Rendering Loss Regularization
Figure 2 for Few-shot NeRF by Adaptive Rendering Loss Regularization
Figure 3 for Few-shot NeRF by Adaptive Rendering Loss Regularization
Figure 4 for Few-shot NeRF by Adaptive Rendering Loss Regularization
Viaarxiv icon