Picture for Junjie Shentu

Junjie Shentu

Everything is a Video: Unifying Modalities through Next-Frame Prediction

Add code
Nov 15, 2024
Viaarxiv icon

AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization

Add code
May 28, 2024
Figure 1 for AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization
Figure 2 for AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization
Figure 3 for AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization
Figure 4 for AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization
Viaarxiv icon

Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation

Add code
Feb 15, 2024
Figure 1 for Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Figure 2 for Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Figure 3 for Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Figure 4 for Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Viaarxiv icon