Picture for Karen Simonyan

Karen Simonyan

Flamingo: a Visual Language Model for Few-Shot Learning

Add code
Apr 29, 2022
Figure 1 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 2 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 3 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 4 for Flamingo: a Visual Language Model for Few-Shot Learning
Viaarxiv icon

Training Compute-Optimal Large Language Models

Add code
Mar 29, 2022
Figure 1 for Training Compute-Optimal Large Language Models
Figure 2 for Training Compute-Optimal Large Language Models
Figure 3 for Training Compute-Optimal Large Language Models
Figure 4 for Training Compute-Optimal Large Language Models
Viaarxiv icon

Hierarchical Perceiver

Add code
Feb 22, 2022
Figure 1 for Hierarchical Perceiver
Figure 2 for Hierarchical Perceiver
Figure 3 for Hierarchical Perceiver
Figure 4 for Hierarchical Perceiver
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Feb 09, 2022
Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon

Improving language models by retrieving from trillions of tokens

Add code
Jan 11, 2022
Figure 1 for Improving language models by retrieving from trillions of tokens
Figure 2 for Improving language models by retrieving from trillions of tokens
Figure 3 for Improving language models by retrieving from trillions of tokens
Figure 4 for Improving language models by retrieving from trillions of tokens
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Dec 08, 2021
Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Machine Translation Decoding beyond Beam Search

Add code
Apr 12, 2021
Figure 1 for Machine Translation Decoding beyond Beam Search
Figure 2 for Machine Translation Decoding beyond Beam Search
Figure 3 for Machine Translation Decoding beyond Beam Search
Figure 4 for Machine Translation Decoding beyond Beam Search
Viaarxiv icon

Skillful Precipitation Nowcasting using Deep Generative Models of Radar

Add code
Apr 02, 2021
Figure 1 for Skillful Precipitation Nowcasting using Deep Generative Models of Radar
Figure 2 for Skillful Precipitation Nowcasting using Deep Generative Models of Radar
Figure 3 for Skillful Precipitation Nowcasting using Deep Generative Models of Radar
Viaarxiv icon

Variable-rate discrete representation learning

Add code
Mar 10, 2021
Figure 1 for Variable-rate discrete representation learning
Figure 2 for Variable-rate discrete representation learning
Figure 3 for Variable-rate discrete representation learning
Figure 4 for Variable-rate discrete representation learning
Viaarxiv icon

High-Performance Large-Scale Image Recognition Without Normalization

Add code
Feb 11, 2021
Figure 1 for High-Performance Large-Scale Image Recognition Without Normalization
Figure 2 for High-Performance Large-Scale Image Recognition Without Normalization
Figure 3 for High-Performance Large-Scale Image Recognition Without Normalization
Figure 4 for High-Performance Large-Scale Image Recognition Without Normalization
Viaarxiv icon