Picture for Qirong Peng

Qirong Peng

X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

Add code
Mar 08, 2025
Viaarxiv icon