Picture for Xiaoliang Dai

Xiaoliang Dai

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Add code
Dec 13, 2024
Viaarxiv icon

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation

Add code
Dec 03, 2024
Viaarxiv icon

Towards Automated Model Design on Recommender Systems

Add code
Nov 12, 2024
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

An Analysis on Quantizing Diffusion Transformers

Add code
Jun 16, 2024
Viaarxiv icon

SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models

Add code
Jun 03, 2024
Figure 1 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Figure 2 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Figure 3 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Figure 4 for SceneTextGen: Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Viaarxiv icon

Efficient Quantization Strategies for Latent Diffusion Models

Add code
Dec 09, 2023
Figure 1 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 2 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 3 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 4 for Efficient Quantization Strategies for Latent Diffusion Models
Viaarxiv icon

LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning

Add code
Dec 06, 2023
Figure 1 for LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Figure 2 for LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Figure 3 for LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Figure 4 for LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Viaarxiv icon

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Add code
Dec 06, 2023
Figure 1 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 2 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 3 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Figure 4 for Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Viaarxiv icon

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Add code
Dec 01, 2023
Viaarxiv icon