Picture for Xihui Liu

Xihui Liu

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

Add code
Dec 09, 2025
Viaarxiv icon

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Add code
Dec 09, 2025
Viaarxiv icon

SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation

Add code
Dec 08, 2025
Viaarxiv icon

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Add code
Oct 30, 2025
Figure 1 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 2 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 3 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 4 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Viaarxiv icon

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Add code
Oct 14, 2025
Viaarxiv icon

DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation

Add code
Sep 19, 2025
Viaarxiv icon

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Add code
Sep 18, 2025
Figure 1 for Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Figure 2 for Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Figure 3 for Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Figure 4 for Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Viaarxiv icon

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Add code
Sep 11, 2025
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Add code
Aug 24, 2025
Figure 1 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Figure 2 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Figure 3 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Figure 4 for T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
Viaarxiv icon