Picture for Qingyu Shi

Qingyu Shi

Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Add code
Feb 02, 2026
Viaarxiv icon

RecTok: Reconstruction Distillation along Rectified Flow

Add code
Dec 17, 2025
Viaarxiv icon

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Add code
May 29, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

An Empirical Study of GPT-4o Image Generation Capabilities

Add code
Apr 08, 2025
Figure 1 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 2 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 3 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 4 for An Empirical Study of GPT-4o Image Generation Capabilities
Viaarxiv icon

Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer

Add code
Mar 21, 2025
Viaarxiv icon

RelationBooth: Towards Relation-Aware Customized Object Generation

Add code
Oct 30, 2024
Figure 1 for RelationBooth: Towards Relation-Aware Customized Object Generation
Figure 2 for RelationBooth: Towards Relation-Aware Customized Object Generation
Figure 3 for RelationBooth: Towards Relation-Aware Customized Object Generation
Figure 4 for RelationBooth: Towards Relation-Aware Customized Object Generation
Viaarxiv icon

RAP-SAM: Towards Real-Time All-Purpose Segment Anything

Add code
Jan 18, 2024
Viaarxiv icon