Picture for Beyza Ermis

Beyza Ermis

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Add code
Oct 14, 2024
Viaarxiv icon

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?

Add code
Oct 08, 2024
Figure 1 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 2 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 3 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Figure 4 for Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
Viaarxiv icon

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Add code
Aug 27, 2024
Viaarxiv icon

The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm

Add code
Jun 26, 2024
Viaarxiv icon

From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

Add code
Mar 06, 2024
Viaarxiv icon

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Add code
Feb 27, 2024
Viaarxiv icon

Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

Add code
Nov 29, 2023
Figure 1 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Figure 2 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Figure 3 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Figure 4 for Elo Uncovered: Robustness and Best Practices in Language Model Evaluation
Viaarxiv icon

Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

Add code
Oct 22, 2023
Viaarxiv icon

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

Add code
Oct 11, 2023
Viaarxiv icon

On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research

Add code
Apr 24, 2023
Viaarxiv icon