Picture for Kaiwen Zheng

Kaiwen Zheng

LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation

Add code
Feb 19, 2025
Viaarxiv icon

Elucidating the Preconditioning in Consistency Distillation

Add code
Feb 05, 2025
Viaarxiv icon

Visual Generation Without Guidance

Add code
Jan 26, 2025
Viaarxiv icon

Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation

Add code
Nov 05, 2024
Figure 1 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Figure 2 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Figure 3 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Figure 4 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Viaarxiv icon

Consistency Diffusion Bridge Models

Add code
Oct 31, 2024
Figure 1 for Consistency Diffusion Bridge Models
Figure 2 for Consistency Diffusion Bridge Models
Figure 3 for Consistency Diffusion Bridge Models
Figure 4 for Consistency Diffusion Bridge Models
Viaarxiv icon

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling

Add code
Sep 04, 2024
Figure 1 for Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
Figure 2 for Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
Figure 3 for Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
Figure 4 for Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
Viaarxiv icon

Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control

Add code
Jul 12, 2024
Figure 1 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Figure 2 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Figure 3 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Figure 4 for Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Viaarxiv icon

Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model

Add code
Jun 22, 2024
Figure 1 for Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Figure 2 for Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Figure 3 for Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Figure 4 for Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Viaarxiv icon

Diffusion Bridge Implicit Models

Add code
May 24, 2024
Figure 1 for Diffusion Bridge Implicit Models
Figure 2 for Diffusion Bridge Implicit Models
Figure 3 for Diffusion Bridge Implicit Models
Figure 4 for Diffusion Bridge Implicit Models
Viaarxiv icon

Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

Add code
May 07, 2024
Figure 1 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Figure 2 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Figure 3 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Figure 4 for Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
Viaarxiv icon