Picture for Zhongjie Duan

Zhongjie Duan

VIRAL: Visual In-Context Reasoning via Analogy in Diffusion Transformers

Add code
Feb 03, 2026
Viaarxiv icon

Spectral Evolution Search: Efficient Inference-Time Scaling for Reward-Aligned Image Generation

Add code
Feb 03, 2026
Viaarxiv icon

Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing

Add code
Apr 30, 2025
Viaarxiv icon

EliGen: Entity-Level Controlled Image Generation with Regional Attention

Add code
Jan 02, 2025
Figure 1 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Figure 2 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Figure 3 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Figure 4 for EliGen: Entity-Level Controlled Image Generation with Regional Attention
Viaarxiv icon

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

Add code
Dec 18, 2024
Figure 1 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Figure 2 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Figure 3 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Figure 4 for ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
Viaarxiv icon

ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

Add code
Jun 20, 2024
Figure 1 for ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
Figure 2 for ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
Figure 3 for ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
Figure 4 for ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
Viaarxiv icon

Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models

Add code
Jan 29, 2024
Viaarxiv icon

FastBlend: a Powerful Model-Free Toolkit Making Video Stylization Easier

Add code
Nov 15, 2023
Viaarxiv icon

Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding

Add code
Nov 12, 2023
Viaarxiv icon

PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud

Add code
Sep 11, 2023
Figure 1 for PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Figure 2 for PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Figure 3 for PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Figure 4 for PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Viaarxiv icon