Picture for Peizhao Zhang

Peizhao Zhang

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Add code
Dec 13, 2024
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

An Analysis on Quantizing Diffusion Transformers

Add code
Jun 16, 2024
Viaarxiv icon

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Add code
Dec 29, 2023
Viaarxiv icon

Efficient Quantization Strategies for Latent Diffusion Models

Add code
Dec 09, 2023
Figure 1 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 2 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 3 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 4 for Efficient Quantization Strategies for Latent Diffusion Models
Viaarxiv icon

ControlRoom3D: Room Generation using Semantic Proxy Rooms

Add code
Dec 08, 2023
Figure 1 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 2 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 3 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 4 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Viaarxiv icon

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Add code
Dec 06, 2023
Figure 1 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 2 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 3 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 4 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Viaarxiv icon

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

Add code
Sep 27, 2023
Viaarxiv icon

Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence

Add code
Apr 24, 2023
Figure 1 for Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Figure 2 for Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Figure 3 for Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Figure 4 for Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Viaarxiv icon

DIME-FM: DIstilling Multimodal and Efficient Foundation Models

Add code
Mar 31, 2023
Viaarxiv icon