Picture for Beyza Ermis

Beyza Ermis

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

Add code
Dec 05, 2024
Viaarxiv icon

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Add code
Dec 04, 2024
Viaarxiv icon

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Add code
Oct 14, 2024
Viaarxiv icon

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?

Add code
Oct 08, 2024
Figure 1 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 2 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 3 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 4 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Viaarxiv icon

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Add code
Aug 27, 2024
Figure 1 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Figure 2 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Figure 3 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Figure 4 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Viaarxiv icon

The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm

Add code
Jun 26, 2024
Viaarxiv icon

From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

Add code
Mar 06, 2024
Viaarxiv icon

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Add code
Feb 27, 2024
Viaarxiv icon

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

Add code
Nov 29, 2023
Figure 1 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Figure 2 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Figure 3 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Figure 4 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Viaarxiv icon

Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

Add code
Oct 22, 2023
Viaarxiv icon