Picture for Machel Reid

Machel Reid

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer

Add code
May 24, 2023
Viaarxiv icon

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations

Add code
May 23, 2023
Figure 1 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Figure 2 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Figure 3 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Figure 4 for mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
Viaarxiv icon

On the Role of Parallel Data in Cross-lingual Transfer Learning

Add code
Dec 20, 2022
Viaarxiv icon

DiffusER: Discrete Diffusion via Edit-based Reconstruction

Add code
Oct 30, 2022
Figure 1 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Figure 2 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Figure 3 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Figure 4 for DiffusER: Discrete Diffusion via Edit-based Reconstruction
Viaarxiv icon

M2D2: A Massively Multi-domain Language Modeling Dataset

Add code
Oct 13, 2022
Figure 1 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 2 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 3 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 4 for M2D2: A Massively Multi-domain Language Modeling Dataset
Viaarxiv icon

Learning to Model Editing Processes

Add code
May 24, 2022
Figure 1 for Learning to Model Editing Processes
Figure 2 for Learning to Model Editing Processes
Figure 3 for Learning to Model Editing Processes
Figure 4 for Learning to Model Editing Processes
Viaarxiv icon