Picture for Sara Hooker

Sara Hooker

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Add code
Oct 20, 2024
Viaarxiv icon

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Add code
Oct 14, 2024
Viaarxiv icon

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts

Add code
Aug 28, 2024
Viaarxiv icon

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Add code
Aug 27, 2024
Viaarxiv icon

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Add code
Aug 20, 2024
Figure 1 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Figure 2 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Figure 3 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Figure 4 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Viaarxiv icon

Consent in Crisis: The Rapid Decline of the AI Data Commons

Add code
Jul 24, 2024
Viaarxiv icon

On the Limitations of Compute Thresholds as a Governance Strategy

Add code
Jul 08, 2024
Viaarxiv icon

How Does Quantization Affect Multilingual LLMs?

Add code
Jul 03, 2024
Viaarxiv icon

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

Add code
Jul 02, 2024
Viaarxiv icon

LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives

Add code
Jul 01, 2024
Viaarxiv icon