Picture for Sedrick Keh

Sedrick Keh

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Add code
Jun 14, 2024
Figure 1 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 2 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 3 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Figure 4 for SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Viaarxiv icon

Linearizing Large Language Models

Add code
May 10, 2024
Viaarxiv icon

Language models scale reliably with over-training and on downstream tasks

Add code
Mar 13, 2024
Figure 1 for Language models scale reliably with over-training and on downstream tasks
Figure 2 for Language models scale reliably with over-training and on downstream tasks
Figure 3 for Language models scale reliably with over-training and on downstream tasks
Figure 4 for Language models scale reliably with over-training and on downstream tasks
Viaarxiv icon

A Critical Evaluation of AI Feedback for Aligning Large Language Models

Add code
Feb 19, 2024
Figure 1 for A Critical Evaluation of AI Feedback for Aligning Large Language Models
Figure 2 for A Critical Evaluation of AI Feedback for Aligning Large Language Models
Figure 3 for A Critical Evaluation of AI Feedback for Aligning Large Language Models
Figure 4 for A Critical Evaluation of AI Feedback for Aligning Large Language Models
Viaarxiv icon

Asking More Informative Questions for Grounded Retrieval

Add code
Nov 14, 2023
Viaarxiv icon