Picture for Zachary Nado

Zachary Nado

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs

Add code
Oct 24, 2024
Figure 1 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 2 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 3 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Figure 4 for A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Benchmarking Neural Network Training Algorithms

Add code
Jun 12, 2023
Figure 1 for Benchmarking Neural Network Training Algorithms
Figure 2 for Benchmarking Neural Network Training Algorithms
Figure 3 for Benchmarking Neural Network Training Algorithms
Figure 4 for Benchmarking Neural Network Training Algorithms
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

Kernel Regression with Infinite-Width Neural Networks on Millions of Examples

Add code
Mar 09, 2023
Viaarxiv icon

Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks

Add code
Nov 23, 2022
Viaarxiv icon

Adaptive Gradient Methods at the Edge of Stability

Add code
Jul 29, 2022
Figure 1 for Adaptive Gradient Methods at the Edge of Stability
Figure 2 for Adaptive Gradient Methods at the Edge of Stability
Figure 3 for Adaptive Gradient Methods at the Edge of Stability
Figure 4 for Adaptive Gradient Methods at the Edge of Stability
Viaarxiv icon

Plex: Towards Reliability using Pretrained Large Model Extensions

Add code
Jul 15, 2022
Figure 1 for Plex: Towards Reliability using Pretrained Large Model Extensions
Figure 2 for Plex: Towards Reliability using Pretrained Large Model Extensions
Figure 3 for Plex: Towards Reliability using Pretrained Large Model Extensions
Figure 4 for Plex: Towards Reliability using Pretrained Large Model Extensions
Viaarxiv icon

Pre-training helps Bayesian optimization too

Add code
Jul 07, 2022
Figure 1 for Pre-training helps Bayesian optimization too
Figure 2 for Pre-training helps Bayesian optimization too
Figure 3 for Pre-training helps Bayesian optimization too
Figure 4 for Pre-training helps Bayesian optimization too
Viaarxiv icon