Picture for Khalid Almubarak

Khalid Almubarak

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Add code
Feb 20, 2024
Viaarxiv icon

CIDAR: Culturally Relevant Instruction Dataset For Arabic

Add code
Feb 05, 2024
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Add code
Dec 19, 2022
Figure 1 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 2 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 3 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 4 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

Add code
Feb 02, 2022
Figure 1 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 2 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 3 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 4 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Viaarxiv icon