Picture for Shaochong Jia

Shaochong Jia

An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models

Add code
Mar 25, 2024
Viaarxiv icon

Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net

Add code
Nov 28, 2023
Viaarxiv icon