Picture for Yulai Zhao

Yulai Zhao

Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding

Add code
Aug 15, 2024
Figure 1 for Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Figure 2 for Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Figure 3 for Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Figure 4 for Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Viaarxiv icon

Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

Add code
Jul 18, 2024
Viaarxiv icon

Adding Conditional Control to Diffusion Models with Reinforcement Learning

Add code
Jun 17, 2024
Figure 1 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 2 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 3 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Figure 4 for Adding Conditional Control to Diffusion Models with Reinforcement Learning
Viaarxiv icon

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

Add code
May 31, 2024
Viaarxiv icon

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

Add code
Feb 28, 2024
Viaarxiv icon

Feedback Efficient Online Fine-Tuning of Diffusion Models

Add code
Feb 27, 2024
Viaarxiv icon

Provably Efficient CVaR RL in Low-rank MDPs

Add code
Nov 20, 2023
Viaarxiv icon

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

Add code
May 08, 2023
Viaarxiv icon

Blessing of Class Diversity in Pre-training

Add code
Sep 12, 2022
Figure 1 for Blessing of Class Diversity in Pre-training
Figure 2 for Blessing of Class Diversity in Pre-training
Figure 3 for Blessing of Class Diversity in Pre-training
Figure 4 for Blessing of Class Diversity in Pre-training
Viaarxiv icon

Optimizing the Performative Risk under Weak Convexity Assumptions

Add code
Sep 12, 2022
Viaarxiv icon