Picture for Michael Hanna

Michael Hanna

LLM Circuit Analyses Are Consistent Across Training and Scale

Add code
Jul 15, 2024
Viaarxiv icon

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Add code
Jun 26, 2024
Figure 1 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 2 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 3 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 4 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Viaarxiv icon

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

Add code
Mar 26, 2024
Viaarxiv icon

Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!

Add code
Feb 19, 2024
Viaarxiv icon

When Language Models Fall in Love: Animacy Processing in Transformer Language Models

Add code
Oct 23, 2023
Viaarxiv icon

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model

Add code
Oct 19, 2023
Viaarxiv icon

ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation

Add code
Oct 17, 2023
Viaarxiv icon

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

Add code
Apr 30, 2023
Viaarxiv icon