Picture for Jiji Tang

Jiji Tang

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization

Add code
Dec 10, 2024
Viaarxiv icon

Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization

Add code
Jun 24, 2024
Viaarxiv icon

Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller

Add code
Mar 12, 2024
Viaarxiv icon

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Add code
Jan 23, 2024
Figure 1 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 2 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 3 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Figure 4 for Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Viaarxiv icon

Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

Add code
Aug 06, 2023
Viaarxiv icon

Structure-CLIP: Enhance Multi-modal Language Representations with Structure Knowledge

Add code
May 06, 2023
Viaarxiv icon

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph

Add code
Jun 30, 2020
Figure 1 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 2 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 3 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 4 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Viaarxiv icon