Picture for Wu Lin

Wu Lin

Training Data Attribution via Approximate Unrolled Differentiation

Add code
May 21, 2024
Viaarxiv icon

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Add code
Feb 13, 2024
Figure 1 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Figure 2 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Figure 3 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Figure 4 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Viaarxiv icon

Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC for Large Neural Nets

Add code
Dec 16, 2023
Viaarxiv icon

Simplifying Momentum-based Riemannian Submanifold Optimization

Add code
Feb 20, 2023
Viaarxiv icon

Structured second-order methods via natural gradient descent

Add code
Jul 22, 2021
Figure 1 for Structured second-order methods via natural gradient descent
Figure 2 for Structured second-order methods via natural gradient descent
Viaarxiv icon

Tractable structured natural gradient descent using local parameterizations

Add code
Mar 04, 2021
Figure 1 for Tractable structured natural gradient descent using local parameterizations
Figure 2 for Tractable structured natural gradient descent using local parameterizations
Figure 3 for Tractable structured natural gradient descent using local parameterizations
Figure 4 for Tractable structured natural gradient descent using local parameterizations
Viaarxiv icon

Handling the Positive-Definite Constraint in the Bayesian Learning Rule

Add code
Mar 08, 2020
Figure 1 for Handling the Positive-Definite Constraint in the Bayesian Learning Rule
Figure 2 for Handling the Positive-Definite Constraint in the Bayesian Learning Rule
Figure 3 for Handling the Positive-Definite Constraint in the Bayesian Learning Rule
Figure 4 for Handling the Positive-Definite Constraint in the Bayesian Learning Rule
Viaarxiv icon

Stein's Lemma for the Reparameterization Trick with Exponential Family Mixtures

Add code
Oct 29, 2019
Viaarxiv icon

Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations

Add code
Jun 07, 2019
Figure 1 for Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations
Viaarxiv icon

Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam

Add code
Aug 02, 2018
Figure 1 for Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam
Figure 2 for Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam
Figure 3 for Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam
Figure 4 for Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam
Viaarxiv icon