Picture for Xiaozhe Yao

Xiaozhe Yao

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Figure 1 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 2 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 3 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 4 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Viaarxiv icon

DeltaZip: Multi-Tenant Language Model Serving via Delta Compression

Add code
Dec 08, 2023
Viaarxiv icon

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Add code
Nov 21, 2023
Figure 1 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Figure 2 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Figure 3 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Viaarxiv icon

DataPerf: Benchmarks for Data-Centric AI Development

Add code
Jul 20, 2022
Figure 1 for DataPerf: Benchmarks for Data-Centric AI Development
Figure 2 for DataPerf: Benchmarks for Data-Centric AI Development
Figure 3 for DataPerf: Benchmarks for Data-Centric AI Development
Figure 4 for DataPerf: Benchmarks for Data-Centric AI Development
Viaarxiv icon

SHiFT: An Efficient, Flexible Search Engine for Transfer Learning

Add code
Apr 04, 2022
Figure 1 for SHiFT: An Efficient, Flexible Search Engine for Transfer Learning
Figure 2 for SHiFT: An Efficient, Flexible Search Engine for Transfer Learning
Figure 3 for SHiFT: An Efficient, Flexible Search Engine for Transfer Learning
Figure 4 for SHiFT: An Efficient, Flexible Search Engine for Transfer Learning
Viaarxiv icon