Picture for Mengqi Huang

Mengqi Huang

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Add code
May 20, 2026
Viaarxiv icon

Stream-T1: Test-Time Scaling for Streaming Video Generation

Add code
May 06, 2026
Viaarxiv icon

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Add code
May 05, 2026
Viaarxiv icon

NativeTok: Native Visual Tokenization for Improved Image Generation

Add code
Jan 30, 2026
Viaarxiv icon

LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning

Add code
Nov 11, 2025
Figure 1 for LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning
Figure 2 for LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning
Figure 3 for LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning
Figure 4 for LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning
Viaarxiv icon

USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning

Add code
Aug 26, 2025
Viaarxiv icon

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Add code
Jul 02, 2025
Figure 1 for LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Figure 2 for LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Figure 3 for LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Figure 4 for LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Viaarxiv icon

HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models

Add code
May 10, 2025
Figure 1 for HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
Figure 2 for HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
Figure 3 for HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
Figure 4 for HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models
Viaarxiv icon

DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization

Add code
May 04, 2025
Viaarxiv icon

D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation

Add code
Apr 13, 2025
Figure 1 for D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
Figure 2 for D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
Figure 3 for D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
Figure 4 for D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
Viaarxiv icon