Picture for Huizhuo Yuan

Huizhuo Yuan

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Add code
Nov 15, 2024
Viaarxiv icon

Accelerated Preference Optimization for Large Language Model Alignment

Add code
Oct 08, 2024
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Viaarxiv icon

Protein Conformation Generation via Force-Guided SE(3) Diffusion Models

Add code
Mar 21, 2024
Viaarxiv icon

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Add code
Feb 15, 2024
Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Jan 02, 2024
Viaarxiv icon

Fast Sampling via De-randomization for Discrete Diffusion Models

Add code
Dec 14, 2023
Viaarxiv icon

Stochastic Recursive Momentum for Policy Gradient Methods

Add code
Mar 09, 2020
Figure 1 for Stochastic Recursive Momentum for Policy Gradient Methods
Figure 2 for Stochastic Recursive Momentum for Policy Gradient Methods
Figure 3 for Stochastic Recursive Momentum for Policy Gradient Methods
Figure 4 for Stochastic Recursive Momentum for Policy Gradient Methods
Viaarxiv icon

Stochastic Modified Equations for Continuous Limit of Stochastic ADMM

Add code
Mar 07, 2020
Figure 1 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Figure 2 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Figure 3 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Figure 4 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Viaarxiv icon

Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization

Add code
Jan 25, 2020
Figure 1 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Figure 2 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Figure 3 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Figure 4 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Viaarxiv icon