Picture for Wenpin Tang

Wenpin Tang

Fine-Tuning Diffusion Generative Models via Rich Preference Optimization

Add code
Mar 13, 2025
Viaarxiv icon

Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning

Add code
Feb 03, 2025
Figure 1 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Figure 2 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Figure 3 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Figure 4 for Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Viaarxiv icon

Regret of exploratory policy improvement and $q$-learning

Add code
Nov 02, 2024
Figure 1 for Regret of exploratory policy improvement and $q$-learning
Viaarxiv icon

RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization

Add code
Oct 05, 2024
Figure 1 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 2 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 3 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Figure 4 for RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Viaarxiv icon

Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Add code
Sep 17, 2024
Viaarxiv icon

Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning

Add code
Sep 12, 2024
Viaarxiv icon

Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions

Add code
May 23, 2024
Figure 1 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Figure 2 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Figure 3 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Figure 4 for Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions
Viaarxiv icon

Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond

Add code
Mar 12, 2024
Viaarxiv icon

Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial

Add code
Feb 12, 2024
Viaarxiv icon

Contractive Diffusion Probabilistic Models

Add code
Jan 23, 2024
Figure 1 for Contractive Diffusion Probabilistic Models
Figure 2 for Contractive Diffusion Probabilistic Models
Figure 3 for Contractive Diffusion Probabilistic Models
Figure 4 for Contractive Diffusion Probabilistic Models
Viaarxiv icon