Picture for Yuanzhi Zhu

Yuanzhi Zhu

Diffusion Reinforcement Learning via Centered Reward Distillation

Add code
Mar 14, 2026
Viaarxiv icon

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Add code
Mar 20, 2025
Viaarxiv icon

Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator

Add code
Mar 19, 2025
Figure 1 for Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator
Figure 2 for Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator
Figure 3 for Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator
Figure 4 for Di$\mathtt{[M]}$O: Distilling Masked Diffusion Models into One-step Generator
Viaarxiv icon

Qwen2.5-VL Technical Report

Add code
Feb 19, 2025
Figure 1 for Qwen2.5-VL Technical Report
Figure 2 for Qwen2.5-VL Technical Report
Figure 3 for Qwen2.5-VL Technical Report
Figure 4 for Qwen2.5-VL Technical Report
Viaarxiv icon

SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

Add code
Jan 07, 2025
Figure 1 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 2 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 3 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 4 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Viaarxiv icon

OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs

Add code
Dec 12, 2024
Figure 1 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Figure 2 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Figure 3 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Figure 4 for OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs
Viaarxiv icon

Accelerating Video Diffusion Models via Distribution Matching

Add code
Dec 08, 2024
Figure 1 for Accelerating Video Diffusion Models via Distribution Matching
Figure 2 for Accelerating Video Diffusion Models via Distribution Matching
Figure 3 for Accelerating Video Diffusion Models via Distribution Matching
Figure 4 for Accelerating Video Diffusion Models via Distribution Matching
Viaarxiv icon

Generalizable Single-Source Cross-modality Medical Image Segmentation via Invariant Causal Mechanisms

Add code
Nov 07, 2024
Viaarxiv icon

Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances

Add code
Oct 24, 2024
Figure 1 for Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Figure 2 for Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Figure 3 for Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Figure 4 for Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Viaarxiv icon

Visual Text Generation in the Wild

Add code
Jul 19, 2024
Figure 1 for Visual Text Generation in the Wild
Figure 2 for Visual Text Generation in the Wild
Figure 3 for Visual Text Generation in the Wild
Figure 4 for Visual Text Generation in the Wild
Viaarxiv icon