Picture for Jianzong Wu

Jianzong Wu

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Add code
Dec 10, 2024
Viaarxiv icon

RelationBooth: Towards Relation-Aware Customized Object Generation

Add code
Oct 30, 2024
Viaarxiv icon

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Add code
Jun 28, 2024
Viaarxiv icon

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Add code
Jun 25, 2024
Viaarxiv icon

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

Add code
Jan 18, 2024
Figure 1 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 2 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 3 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 4 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Viaarxiv icon

Towards Open Vocabulary Learning: A Survey

Add code
Jul 06, 2023
Figure 1 for Towards Open Vocabulary Learning: A Survey
Figure 2 for Towards Open Vocabulary Learning: A Survey
Figure 3 for Towards Open Vocabulary Learning: A Survey
Figure 4 for Towards Open Vocabulary Learning: A Survey
Viaarxiv icon

Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

Add code
Jan 02, 2023
Viaarxiv icon

Towards Robust Referring Image Segmentation

Add code
Sep 20, 2022
Figure 1 for Towards Robust Referring Image Segmentation
Figure 2 for Towards Robust Referring Image Segmentation
Figure 3 for Towards Robust Referring Image Segmentation
Figure 4 for Towards Robust Referring Image Segmentation
Viaarxiv icon