Picture for Xiu Li

Xiu Li

PRISM: Rethinking Scattered Atmosphere Reconstruction as a Unified Understanding and Generation Model for Real-world Dehazing

Add code
Apr 08, 2026
Viaarxiv icon

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Add code
Apr 06, 2026
Viaarxiv icon

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Add code
Mar 26, 2026
Viaarxiv icon

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

Add code
Mar 26, 2026
Viaarxiv icon

TopoMesh: High-Fidelity Mesh Autoencoding via Topological Unification

Add code
Mar 25, 2026
Viaarxiv icon

Identity-Consistent Video Generation under Large Facial-Angle Variations

Add code
Mar 22, 2026
Viaarxiv icon

MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization

Add code
Mar 13, 2026
Viaarxiv icon

RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization

Add code
Mar 13, 2026
Viaarxiv icon

InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization

Add code
Mar 10, 2026
Viaarxiv icon

PreciseCache: Precise Feature Caching for Efficient and High-fidelity Video Generation

Add code
Mar 03, 2026
Viaarxiv icon