Picture for Edward Meeds

Edward Meeds

Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

Add code
Feb 11, 2025
Viaarxiv icon

Gradient Multi-Normalization for Stateless and Scalable LLM Training

Add code
Feb 10, 2025
Viaarxiv icon

SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training

Add code
Dec 23, 2024
Figure 1 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Figure 2 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Figure 3 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Figure 4 for SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training
Viaarxiv icon

SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction

Add code
Dec 17, 2024
Figure 1 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Figure 2 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Figure 3 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Figure 4 for SWAN: Preprocessing SGD Enables Adam-Level Performance On LLM Training With Significant Memory Reduction
Viaarxiv icon

AIRIVA: A Deep Generative Model of Adaptive Immune Repertoires

Add code
Apr 26, 2023
Viaarxiv icon

Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations

Add code
Feb 25, 2022
Figure 1 for Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations
Figure 2 for Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations
Figure 3 for Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations
Figure 4 for Capturing Actionable Dynamics with Structured Latent Ordinary Differential Equations
Viaarxiv icon

Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems

Add code
May 28, 2019
Figure 1 for Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems
Figure 2 for Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems
Figure 3 for Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems
Figure 4 for Efficient Amortised Bayesian Inference for Hierarchical and Nonlinear Dynamical Systems
Viaarxiv icon

Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks

Add code
Oct 09, 2018
Figure 1 for Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks
Figure 2 for Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks
Figure 3 for Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks
Figure 4 for Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks
Viaarxiv icon

Soft Weight-Sharing for Neural Network Compression

Add code
May 09, 2017
Figure 1 for Soft Weight-Sharing for Neural Network Compression
Figure 2 for Soft Weight-Sharing for Neural Network Compression
Figure 3 for Soft Weight-Sharing for Neural Network Compression
Figure 4 for Soft Weight-Sharing for Neural Network Compression
Viaarxiv icon

Automatic Variational ABC

Add code
Jun 28, 2016
Figure 1 for Automatic Variational ABC
Figure 2 for Automatic Variational ABC
Figure 3 for Automatic Variational ABC
Viaarxiv icon