Picture for Runa Eschenhagen

Runa Eschenhagen

Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition

Add code
Feb 20, 2025
Viaarxiv icon

Spectral-factorized Positive-definite Curvature Learning for NN Training

Add code
Feb 10, 2025
Viaarxiv icon

Position: Curvature Matrices Should Be Democratized via Linear Operators

Add code
Jan 31, 2025
Figure 1 for Position: Curvature Matrices Should Be Democratized via Linear Operators
Figure 2 for Position: Curvature Matrices Should Be Democratized via Linear Operators
Figure 3 for Position: Curvature Matrices Should Be Democratized via Linear Operators
Figure 4 for Position: Curvature Matrices Should Be Democratized via Linear Operators
Viaarxiv icon

Influence Functions for Scalable Data Attribution in Diffusion Models

Add code
Oct 17, 2024
Viaarxiv icon

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Add code
Feb 13, 2024
Figure 1 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Figure 2 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Figure 3 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Figure 4 for Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Viaarxiv icon

Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC for Large Neural Nets

Add code
Dec 16, 2023
Viaarxiv icon

Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures

Add code
Nov 01, 2023
Figure 1 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Figure 2 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Figure 3 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Figure 4 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Viaarxiv icon

Benchmarking Neural Network Training Algorithms

Add code
Jun 12, 2023
Figure 1 for Benchmarking Neural Network Training Algorithms
Figure 2 for Benchmarking Neural Network Training Algorithms
Figure 3 for Benchmarking Neural Network Training Algorithms
Figure 4 for Benchmarking Neural Network Training Algorithms
Viaarxiv icon

Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization

Add code
Apr 17, 2023
Figure 1 for Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization
Figure 2 for Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization
Figure 3 for Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization
Figure 4 for Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization
Viaarxiv icon

Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs

Add code
Aug 02, 2022
Figure 1 for Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs
Figure 2 for Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs
Figure 3 for Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs
Figure 4 for Approximate Bayesian Neural Operators: Uncertainty Quantification for Parametric PDEs
Viaarxiv icon