Picture for Kai Zhu

Kai Zhu

Improved Video VAE for Latent Video Diffusion Model

Add code
Nov 10, 2024
Viaarxiv icon

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Figure 1 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 2 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 3 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 4 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Viaarxiv icon

ViViD: Video Virtual Try-on using Diffusion Models

Add code
May 20, 2024
Figure 1 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 2 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 3 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 4 for ViViD: Video Virtual Try-on using Diffusion Models
Viaarxiv icon

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

Add code
Apr 17, 2024
Viaarxiv icon

Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation

Add code
Mar 22, 2024
Figure 1 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 2 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 3 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 4 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Viaarxiv icon

Intention-driven Ego-to-Exo Video Generation

Add code
Mar 17, 2024
Figure 1 for Intention-driven Ego-to-Exo Video Generation
Figure 2 for Intention-driven Ego-to-Exo Video Generation
Figure 3 for Intention-driven Ego-to-Exo Video Generation
Figure 4 for Intention-driven Ego-to-Exo Video Generation
Viaarxiv icon

CCM: Adding Conditional Controls to Text-to-Image Consistency Models

Add code
Dec 12, 2023
Figure 1 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 2 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 3 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 4 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Viaarxiv icon

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection

Add code
Dec 04, 2023
Viaarxiv icon

Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation

Add code
Sep 22, 2023
Viaarxiv icon

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models

Add code
Aug 06, 2023
Viaarxiv icon