Picture for Zaid Alyafeai

Zaid Alyafeai

Arabic Stable LM: Adapting Stable LM 2 1.6B to Arabic

Add code
Dec 05, 2024
Viaarxiv icon

Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training

Add code
Oct 28, 2024
Viaarxiv icon

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Add code
Feb 20, 2024
Viaarxiv icon

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Add code
Feb 09, 2024
Figure 1 for Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Figure 2 for Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Figure 3 for Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Figure 4 for Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Viaarxiv icon

CIDAR: Culturally Relevant Instruction Dataset For Arabic

Add code
Feb 05, 2024
Viaarxiv icon

Ashaar: Automatic Analysis and Generation of Arabic Poetry Using Deep Learning Approaches

Add code
Jul 12, 2023
Viaarxiv icon

Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models

Add code
Jun 28, 2023
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon