Picture for Gang Wu

Gang Wu

Towards Semantic-based Agent Communication Networks: Vision, Technologies, and Challenges

Add code
Mar 25, 2026
Viaarxiv icon

Anticipatory Planning for Multimodal AI Agents

Add code
Mar 17, 2026
Viaarxiv icon

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Add code
Mar 04, 2026
Viaarxiv icon

Proc3D: Procedural 3D Generation and Parametric Editing of 3D Shapes with Large Language Models

Add code
Jan 18, 2026
Viaarxiv icon

UDPNet: Unleashing Depth-based Priors for Robust Image Dehazing

Add code
Jan 11, 2026
Viaarxiv icon

Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse

Add code
Dec 18, 2025
Figure 1 for Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse
Figure 2 for Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse
Figure 3 for Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse
Figure 4 for Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse
Viaarxiv icon

MedGEN-Bench: Contextually entangled benchmark for open-ended multimodal medical generation

Add code
Nov 18, 2025
Viaarxiv icon

Exploring the Global-to-Local Attention Scheme in Graph Transformers: An Empirical Study

Add code
Sep 18, 2025
Viaarxiv icon

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Add code
Jul 09, 2025
Figure 1 for A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Figure 2 for A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Viaarxiv icon

Boosting All-in-One Image Restoration via Self-Improved Privilege Learning

Add code
May 30, 2025
Figure 1 for Boosting All-in-One Image Restoration via Self-Improved Privilege Learning
Figure 2 for Boosting All-in-One Image Restoration via Self-Improved Privilege Learning
Figure 3 for Boosting All-in-One Image Restoration via Self-Improved Privilege Learning
Figure 4 for Boosting All-in-One Image Restoration via Self-Improved Privilege Learning
Viaarxiv icon