Picture for Baptiste Rozière

Baptiste Rozière

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Better & Faster Large Language Models via Multi-token Prediction

Add code
Apr 30, 2024
Figure 1 for Better & Faster Large Language Models via Multi-token Prediction
Figure 2 for Better & Faster Large Language Models via Multi-token Prediction
Figure 3 for Better & Faster Large Language Models via Multi-token Prediction
Figure 4 for Better & Faster Large Language Models via Multi-token Prediction
Viaarxiv icon

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Mar 12, 2024
Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

Getting the most out of your tokenizer for pre-training and domain adaptation

Add code
Feb 07, 2024
Figure 1 for Getting the most out of your tokenizer for pre-training and domain adaptation
Figure 2 for Getting the most out of your tokenizer for pre-training and domain adaptation
Figure 3 for Getting the most out of your tokenizer for pre-training and domain adaptation
Figure 4 for Getting the most out of your tokenizer for pre-training and domain adaptation
Viaarxiv icon

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Add code
Jan 05, 2024
Viaarxiv icon

Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data

Add code
Dec 05, 2023
Viaarxiv icon

Code Llama: Open Foundation Models for Code

Add code
Aug 25, 2023
Figure 1 for Code Llama: Open Foundation Models for Code
Figure 2 for Code Llama: Open Foundation Models for Code
Figure 3 for Code Llama: Open Foundation Models for Code
Figure 4 for Code Llama: Open Foundation Models for Code
Viaarxiv icon

LLaMA: Open and Efficient Foundation Language Models

Add code
Feb 27, 2023
Viaarxiv icon

Augmented Language Models: a Survey

Add code
Feb 15, 2023
Figure 1 for Augmented Language Models: a Survey
Figure 2 for Augmented Language Models: a Survey
Figure 3 for Augmented Language Models: a Survey
Figure 4 for Augmented Language Models: a Survey
Viaarxiv icon