Picture for Barret Zoph

Barret Zoph

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

Add code
May 24, 2023
Viaarxiv icon

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Add code
May 22, 2023
Viaarxiv icon

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Add code
Feb 14, 2023
Viaarxiv icon

Scaling Instruction-Finetuned Language Models

Add code
Oct 20, 2022
Figure 1 for Scaling Instruction-Finetuned Language Models
Figure 2 for Scaling Instruction-Finetuned Language Models
Figure 3 for Scaling Instruction-Finetuned Language Models
Figure 4 for Scaling Instruction-Finetuned Language Models
Viaarxiv icon

A Review of Sparse Expert Models in Deep Learning

Add code
Sep 04, 2022
Figure 1 for A Review of Sparse Expert Models in Deep Learning
Figure 2 for A Review of Sparse Expert Models in Deep Learning
Figure 3 for A Review of Sparse Expert Models in Deep Learning
Figure 4 for A Review of Sparse Expert Models in Deep Learning
Viaarxiv icon

Emergent Abilities of Large Language Models

Add code
Jun 15, 2022
Figure 1 for Emergent Abilities of Large Language Models
Figure 2 for Emergent Abilities of Large Language Models
Figure 3 for Emergent Abilities of Large Language Models
Figure 4 for Emergent Abilities of Large Language Models
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

PaLM: Scaling Language Modeling with Pathways

Add code
Apr 19, 2022
Figure 1 for PaLM: Scaling Language Modeling with Pathways
Figure 2 for PaLM: Scaling Language Modeling with Pathways
Figure 3 for PaLM: Scaling Language Modeling with Pathways
Figure 4 for PaLM: Scaling Language Modeling with Pathways
Viaarxiv icon

Designing Effective Sparse Expert Models

Add code
Feb 17, 2022
Figure 1 for Designing Effective Sparse Expert Models
Figure 2 for Designing Effective Sparse Expert Models
Figure 3 for Designing Effective Sparse Expert Models
Figure 4 for Designing Effective Sparse Expert Models
Viaarxiv icon