Picture for Adam Roberts

Adam Roberts

Training LLMs over Neurally Compressed Text

Add code
Apr 04, 2024
Figure 1 for Training LLMs over Neurally Compressed Text
Figure 2 for Training LLMs over Neurally Compressed Text
Figure 3 for Training LLMs over Neurally Compressed Text
Figure 4 for Training LLMs over Neurally Compressed Text
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Add code
May 22, 2023
Figure 1 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 2 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 3 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 4 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Viaarxiv icon

UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining

Add code
Apr 18, 2023
Figure 1 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 2 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 3 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 4 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Viaarxiv icon

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Add code
Feb 14, 2023
Figure 1 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 2 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 3 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 4 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Viaarxiv icon

SingSong: Generating musical accompaniments from singing

Add code
Jan 30, 2023
Figure 1 for SingSong: Generating musical accompaniments from singing
Figure 2 for SingSong: Generating musical accompaniments from singing
Figure 3 for SingSong: Generating musical accompaniments from singing
Figure 4 for SingSong: Generating musical accompaniments from singing
Viaarxiv icon

MusicLM: Generating Music From Text

Add code
Jan 26, 2023
Figure 1 for MusicLM: Generating Music From Text
Figure 2 for MusicLM: Generating Music From Text
Figure 3 for MusicLM: Generating Music From Text
Figure 4 for MusicLM: Generating Music From Text
Viaarxiv icon

Character-Aware Models Improve Visual Text Rendering

Add code
Dec 20, 2022
Figure 1 for Character-Aware Models Improve Visual Text Rendering
Figure 2 for Character-Aware Models Improve Visual Text Rendering
Figure 3 for Character-Aware Models Improve Visual Text Rendering
Figure 4 for Character-Aware Models Improve Visual Text Rendering
Viaarxiv icon

VeLO: Training Versatile Learned Optimizers by Scaling Up

Add code
Nov 17, 2022
Figure 1 for VeLO: Training Versatile Learned Optimizers by Scaling Up
Figure 2 for VeLO: Training Versatile Learned Optimizers by Scaling Up
Figure 3 for VeLO: Training Versatile Learned Optimizers by Scaling Up
Figure 4 for VeLO: Training Versatile Learned Optimizers by Scaling Up
Viaarxiv icon

Large Language Models Struggle to Learn Long-Tail Knowledge

Add code
Nov 15, 2022
Figure 1 for Large Language Models Struggle to Learn Long-Tail Knowledge
Figure 2 for Large Language Models Struggle to Learn Long-Tail Knowledge
Figure 3 for Large Language Models Struggle to Learn Long-Tail Knowledge
Figure 4 for Large Language Models Struggle to Learn Long-Tail Knowledge
Viaarxiv icon