Picture for Zaiwen Wen

Zaiwen Wen

Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures

Add code
Oct 10, 2024
Viaarxiv icon

ODE-based Learning to Optimize

Add code
Jun 04, 2024
Figure 1 for ODE-based Learning to Optimize
Figure 2 for ODE-based Learning to Optimize
Figure 3 for ODE-based Learning to Optimize
Figure 4 for ODE-based Learning to Optimize
Viaarxiv icon

An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks

Add code
May 07, 2024
Viaarxiv icon

Monte Carlo Policy Gradient Method for Binary Optimization

Add code
Jul 03, 2023
Viaarxiv icon

Provable Convergence of Variational Monte Carlo Methods

Add code
Mar 19, 2023
Viaarxiv icon

Provably Efficient Gauss-Newton Temporal Difference Learning Method with Function Approximation

Add code
Feb 25, 2023
Viaarxiv icon

Riemannian Natural Gradient Methods

Add code
Jul 15, 2022
Figure 1 for Riemannian Natural Gradient Methods
Figure 2 for Riemannian Natural Gradient Methods
Figure 3 for Riemannian Natural Gradient Methods
Viaarxiv icon

A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP

Add code
Jul 13, 2022
Figure 1 for A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Viaarxiv icon

NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning

Add code
Jun 14, 2021
Figure 1 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Figure 2 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Figure 3 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Figure 4 for NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning
Viaarxiv icon

A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning

Add code
May 20, 2021
Figure 1 for A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Figure 2 for A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Figure 3 for A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Figure 4 for A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Viaarxiv icon