Picture for Xingang Pan

Xingang Pan

PnP-U3D: Plug-and-Play 3D Framework Bridging Autoregression and Diffusion for Unified Understanding and Generation

Add code
Feb 03, 2026
Viaarxiv icon

PI-Light: Physics-Inspired Diffusion for Full-Image Relighting

Add code
Jan 29, 2026
Viaarxiv icon

StoryMem: Multi-shot Long Video Storytelling with Memory

Add code
Dec 22, 2025
Figure 1 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 2 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 3 for StoryMem: Multi-shot Long Video Storytelling with Memory
Figure 4 for StoryMem: Multi-shot Long Video Storytelling with Memory
Viaarxiv icon

Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers

Add code
Dec 18, 2025
Figure 1 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 2 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 3 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Figure 4 for Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
Viaarxiv icon

BokehDepth: Enhancing Monocular Depth Estimation through Bokeh Generation

Add code
Dec 13, 2025
Viaarxiv icon

FastMesh: Efficient Artistic Mesh Generation via Component Decoupling

Add code
Aug 27, 2025
Viaarxiv icon

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Add code
Aug 14, 2025
Viaarxiv icon

WORLDMEM: Long-term Consistent World Simulation with Memory

Add code
Apr 16, 2025
Figure 1 for WORLDMEM: Long-term Consistent World Simulation with Memory
Figure 2 for WORLDMEM: Long-term Consistent World Simulation with Memory
Figure 3 for WORLDMEM: Long-term Consistent World Simulation with Memory
Figure 4 for WORLDMEM: Long-term Consistent World Simulation with Memory
Viaarxiv icon

FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing

Add code
Mar 20, 2025
Figure 1 for FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Figure 2 for FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Figure 3 for FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Figure 4 for FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Viaarxiv icon

Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models

Add code
Mar 13, 2025
Viaarxiv icon