Picture for Max Marion

Max Marion

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Add code
May 30, 2024
Viaarxiv icon

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

Add code
Sep 08, 2023
Viaarxiv icon