Picture for Huizhuo Yuan

Huizhuo Yuan

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Add code
Nov 15, 2024
Figure 1 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 2 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 3 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Figure 4 for MARS: Unleashing the Power of Variance Reduction for Training Large Models
Viaarxiv icon

Accelerated Preference Optimization for Large Language Model Alignment

Add code
Oct 08, 2024
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Figure 1 for Self-Play Preference Optimization for Language Model Alignment
Figure 2 for Self-Play Preference Optimization for Language Model Alignment
Figure 3 for Self-Play Preference Optimization for Language Model Alignment
Figure 4 for Self-Play Preference Optimization for Language Model Alignment
Viaarxiv icon

Protein Conformation Generation via Force-Guided SE(3) Diffusion Models

Add code
Mar 21, 2024
Figure 1 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 2 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 3 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Figure 4 for Protein Conformation Generation via Force-Guided SE(3) Diffusion Models
Viaarxiv icon

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Add code
Feb 15, 2024
Figure 1 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 2 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 3 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Figure 4 for Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Jan 02, 2024
Viaarxiv icon

Fast Sampling via De-randomization for Discrete Diffusion Models

Add code
Dec 14, 2023
Viaarxiv icon

Stochastic Recursive Momentum for Policy Gradient Methods

Add code
Mar 09, 2020
Figure 1 for Stochastic Recursive Momentum for Policy Gradient Methods
Figure 2 for Stochastic Recursive Momentum for Policy Gradient Methods
Figure 3 for Stochastic Recursive Momentum for Policy Gradient Methods
Figure 4 for Stochastic Recursive Momentum for Policy Gradient Methods
Viaarxiv icon

Stochastic Modified Equations for Continuous Limit of Stochastic ADMM

Add code
Mar 07, 2020
Figure 1 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Figure 2 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Figure 3 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Figure 4 for Stochastic Modified Equations for Continuous Limit of Stochastic ADMM
Viaarxiv icon

Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization

Add code
Jan 25, 2020
Figure 1 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Figure 2 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Figure 3 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Figure 4 for Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization
Viaarxiv icon