Picture for Todor Mihaylov

Todor Mihaylov

Jack

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Jul 19, 2023
Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

Understanding In-Context Learning via Supportive Pretraining Data

Add code
Jun 26, 2023
Viaarxiv icon

bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark

Add code
Jun 07, 2023
Viaarxiv icon

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Add code
Jan 05, 2023
Figure 1 for Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Figure 2 for Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Figure 3 for Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Figure 4 for Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Viaarxiv icon

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Add code
Dec 28, 2022
Viaarxiv icon

OPT: Open Pre-trained Transformer Language Models

Add code
May 05, 2022
Figure 1 for OPT: Open Pre-trained Transformer Language Models
Figure 2 for OPT: Open Pre-trained Transformer Language Models
Figure 3 for OPT: Open Pre-trained Transformer Language Models
Figure 4 for OPT: Open Pre-trained Transformer Language Models
Viaarxiv icon

Improving In-Context Few-Shot Learning via Self-Supervised Training

Add code
May 03, 2022
Figure 1 for Improving In-Context Few-Shot Learning via Self-Supervised Training
Figure 2 for Improving In-Context Few-Shot Learning via Self-Supervised Training
Figure 3 for Improving In-Context Few-Shot Learning via Self-Supervised Training
Figure 4 for Improving In-Context Few-Shot Learning via Self-Supervised Training
Viaarxiv icon

Efficient Large Scale Language Modeling with Mixtures of Experts

Add code
Dec 20, 2021
Figure 1 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 2 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 3 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 4 for Efficient Large Scale Language Modeling with Mixtures of Experts
Viaarxiv icon

Few-shot Learning with Multilingual Language Models

Add code
Dec 20, 2021
Figure 1 for Few-shot Learning with Multilingual Language Models
Figure 2 for Few-shot Learning with Multilingual Language Models
Figure 3 for Few-shot Learning with Multilingual Language Models
Figure 4 for Few-shot Learning with Multilingual Language Models
Viaarxiv icon