Picture for Kai Zhu

Kai Zhu

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Add code
Jan 16, 2025
Viaarxiv icon

MangaNinja: Line Art Colorization with Precise Reference Following

Add code
Jan 14, 2025
Viaarxiv icon

DepthLab: From Partial to Complete

Add code
Dec 24, 2024
Viaarxiv icon

Improved Video VAE for Latent Video Diffusion Model

Add code
Nov 10, 2024
Figure 1 for Improved Video VAE for Latent Video Diffusion Model
Figure 2 for Improved Video VAE for Latent Video Diffusion Model
Figure 3 for Improved Video VAE for Latent Video Diffusion Model
Figure 4 for Improved Video VAE for Latent Video Diffusion Model
Viaarxiv icon

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Figure 1 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 2 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 3 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 4 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Viaarxiv icon

ViViD: Video Virtual Try-on using Diffusion Models

Add code
May 20, 2024
Figure 1 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 2 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 3 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 4 for ViViD: Video Virtual Try-on using Diffusion Models
Viaarxiv icon

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

Add code
Apr 17, 2024
Figure 1 for InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Figure 2 for InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Figure 3 for InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Figure 4 for InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Viaarxiv icon

Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation

Add code
Mar 22, 2024
Figure 1 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 2 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 3 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 4 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Viaarxiv icon

Intention-driven Ego-to-Exo Video Generation

Add code
Mar 17, 2024
Figure 1 for Intention-driven Ego-to-Exo Video Generation
Figure 2 for Intention-driven Ego-to-Exo Video Generation
Figure 3 for Intention-driven Ego-to-Exo Video Generation
Figure 4 for Intention-driven Ego-to-Exo Video Generation
Viaarxiv icon

CCM: Adding Conditional Controls to Text-to-Image Consistency Models

Add code
Dec 12, 2023
Figure 1 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 2 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 3 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 4 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Viaarxiv icon