Picture for Yi Xin

Yi Xin

Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Add code
Feb 02, 2026
Viaarxiv icon

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Add code
Dec 25, 2025
Figure 1 for UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
Figure 2 for UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
Figure 3 for UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
Figure 4 for UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
Viaarxiv icon

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

From Masks to Worlds: A Hitchhiker's Guide to World Models

Add code
Oct 23, 2025
Viaarxiv icon

LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation

Add code
Aug 06, 2025
Viaarxiv icon

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Add code
Jul 30, 2025
Viaarxiv icon

Low-Cost Test-Time Adaptation for Robust Video Editing

Add code
Jul 29, 2025
Figure 1 for Low-Cost Test-Time Adaptation for Robust Video Editing
Figure 2 for Low-Cost Test-Time Adaptation for Robust Video Editing
Figure 3 for Low-Cost Test-Time Adaptation for Robust Video Editing
Figure 4 for Low-Cost Test-Time Adaptation for Robust Video Editing
Viaarxiv icon

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Add code
Jul 23, 2025
Figure 1 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 2 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 3 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Figure 4 for Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
Viaarxiv icon

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Add code
Jul 17, 2025
Figure 1 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 2 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 3 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Figure 4 for Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
Viaarxiv icon

Partitioner Guided Modal Learning Framework

Add code
Jul 15, 2025
Viaarxiv icon