Picture for Orevaoghene Ahia

Orevaoghene Ahia

MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

Add code
Jul 11, 2024
Figure 1 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 2 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 3 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 4 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Viaarxiv icon

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects

Add code
Jun 27, 2024
Viaarxiv icon

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Add code
Jun 22, 2024
Viaarxiv icon

Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning

Add code
May 29, 2024
Viaarxiv icon

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Add code
Mar 16, 2024
Viaarxiv icon

MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling

Add code
Mar 15, 2024
Viaarxiv icon

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

Add code
Feb 27, 2024
Viaarxiv icon

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?

Add code
Oct 23, 2023
Viaarxiv icon

Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models

Add code
May 23, 2023
Viaarxiv icon

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

Add code
May 11, 2023
Viaarxiv icon