Picture for Sara Hooker

Sara Hooker

Bridging the Data Provenance Gap Across Text, Speech and Video

Add code
Dec 19, 2024
Figure 1 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 2 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 3 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 4 for Bridging the Data Provenance Gap Across Text, Speech and Video
Viaarxiv icon

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

Add code
Dec 05, 2024
Viaarxiv icon

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Add code
Dec 04, 2024
Viaarxiv icon

The Reality of AI and Biorisk

Add code
Dec 02, 2024
Viaarxiv icon

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Add code
Nov 29, 2024
Figure 1 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 2 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 3 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 4 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Viaarxiv icon

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Add code
Oct 20, 2024
Figure 1 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 2 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 3 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 4 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Viaarxiv icon

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Add code
Oct 14, 2024
Viaarxiv icon

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts

Add code
Aug 28, 2024
Viaarxiv icon

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Add code
Aug 27, 2024
Figure 1 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Figure 2 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Figure 3 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Figure 4 for Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress
Viaarxiv icon

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Add code
Aug 20, 2024
Figure 1 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Figure 2 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Figure 3 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Figure 4 for To Code, or Not To Code? Exploring Impact of Code in Pre-training
Viaarxiv icon