Picture for Nikhil Anand

Nikhil Anand

Loss-to-Loss Prediction: Scaling Laws for All Datasets

Add code
Nov 19, 2024
Figure 1 for Loss-to-Loss Prediction: Scaling Laws for All Datasets
Figure 2 for Loss-to-Loss Prediction: Scaling Laws for All Datasets
Figure 3 for Loss-to-Loss Prediction: Scaling Laws for All Datasets
Figure 4 for Loss-to-Loss Prediction: Scaling Laws for All Datasets
Viaarxiv icon

Mixture of Parrots: Experts improve memorization more than reasoning

Add code
Oct 24, 2024
Figure 1 for Mixture of Parrots: Experts improve memorization more than reasoning
Figure 2 for Mixture of Parrots: Experts improve memorization more than reasoning
Figure 3 for Mixture of Parrots: Experts improve memorization more than reasoning
Figure 4 for Mixture of Parrots: Experts improve memorization more than reasoning
Viaarxiv icon

Dataset Difficulty and the Role of Inductive Bias

Add code
Jan 03, 2024
Viaarxiv icon

Influence Scores at Scale for Efficient Language Data Sampling

Add code
Nov 27, 2023
Viaarxiv icon

Comprehensive Benchmarking of Entropy and Margin Based Scoring Metrics for Data Selection

Add code
Nov 27, 2023
Viaarxiv icon