Picture for Naomi Saphra

Naomi Saphra

PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs

Add code
Mar 12, 2025
Viaarxiv icon

Distributional Scaling Laws for Emergent Capabilities

Add code
Feb 24, 2025
Viaarxiv icon

Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization

Add code
Dec 05, 2024
Viaarxiv icon

Mechanistic?

Add code
Oct 07, 2024
Viaarxiv icon

Fast Forwarding Low-Rank Training

Add code
Sep 06, 2024
Figure 1 for Fast Forwarding Low-Rank Training
Figure 2 for Fast Forwarding Low-Rank Training
Figure 3 for Fast Forwarding Low-Rank Training
Figure 4 for Fast Forwarding Low-Rank Training
Viaarxiv icon

Benchmarks as Microscopes: A Call for Model Metrology

Add code
Jul 22, 2024
Viaarxiv icon

ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context

Add code
Jul 10, 2024
Viaarxiv icon

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Add code
Jun 25, 2024
Figure 1 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 2 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 3 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Figure 4 for Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Viaarxiv icon

Transcendence: Generative Models Can Outperform The Experts That Train Them

Add code
Jun 17, 2024
Viaarxiv icon

Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data

Add code
Mar 19, 2024
Viaarxiv icon