Picture for Yonatan Oren

Yonatan Oren

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Proving Test Set Contamination in Black Box Language Models

Add code
Oct 26, 2023
Viaarxiv icon

Distributionally Robust Language Modeling

Add code
Sep 04, 2019
Figure 1 for Distributionally Robust Language Modeling
Figure 2 for Distributionally Robust Language Modeling
Figure 3 for Distributionally Robust Language Modeling
Figure 4 for Distributionally Robust Language Modeling
Viaarxiv icon

A Retrieve-and-Edit Framework for Predicting Structured Outputs

Add code
Dec 04, 2018
Figure 1 for A Retrieve-and-Edit Framework for Predicting Structured Outputs
Figure 2 for A Retrieve-and-Edit Framework for Predicting Structured Outputs
Figure 3 for A Retrieve-and-Edit Framework for Predicting Structured Outputs
Figure 4 for A Retrieve-and-Edit Framework for Predicting Structured Outputs
Viaarxiv icon

Generating Sentences by Editing Prototypes

Add code
Sep 07, 2018
Viaarxiv icon