Picture for Naomi Saphra

Naomi Saphra

Mechanistic?

Add code
Oct 07, 2024
Viaarxiv icon

Fast Forwarding Low-Rank Training

Add code
Sep 06, 2024
Viaarxiv icon

Benchmarks as Microscopes: A Call for Model Metrology

Add code
Jul 22, 2024
Viaarxiv icon

ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context

Add code
Jul 10, 2024
Viaarxiv icon

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Add code
Jun 25, 2024
Viaarxiv icon

Transcendence: Generative Models Can Outperform The Experts That Train Them

Add code
Jun 17, 2024
Viaarxiv icon

Knowing Your Nonlinearities: Shapley Interactions Reveal the Underlying Structure of Data

Add code
Mar 19, 2024
Viaarxiv icon

Towards out-of-distribution generalization in large-scale astronomical surveys: robust networks learn similar representations

Add code
Nov 29, 2023
Viaarxiv icon

Attribute Diversity Determines the Systematicity Gap in VQA

Add code
Nov 15, 2023
Viaarxiv icon

First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models

Add code
Nov 08, 2023
Viaarxiv icon