Picture for David Emerson

David Emerson

The Impact of Unstated Norms in Bias Analysis of Language Models

Add code
Apr 07, 2024
Viaarxiv icon

FlexModel: A Framework for Interpretability of Distributed Large Language Models

Add code
Dec 05, 2023
Figure 1 for FlexModel: A Framework for Interpretability of Distributed Large Language Models
Figure 2 for FlexModel: A Framework for Interpretability of Distributed Large Language Models
Figure 3 for FlexModel: A Framework for Interpretability of Distributed Large Language Models
Figure 4 for FlexModel: A Framework for Interpretability of Distributed Large Language Models
Viaarxiv icon

Interpretable Stereotype Identification through Reasoning

Add code
Jul 24, 2023
Viaarxiv icon

Can Instruction Fine-Tuned Language Models Identify Social Bias through Prompting?

Add code
Jul 19, 2023
Figure 1 for Can Instruction Fine-Tuned Language Models Identify Social Bias through Prompting?
Viaarxiv icon

Soft-prompt Tuning for Large Language Models to Evaluate Bias

Add code
Jun 07, 2023
Figure 1 for Soft-prompt Tuning for Large Language Models to Evaluate Bias
Figure 2 for Soft-prompt Tuning for Large Language Models to Evaluate Bias
Figure 3 for Soft-prompt Tuning for Large Language Models to Evaluate Bias
Figure 4 for Soft-prompt Tuning for Large Language Models to Evaluate Bias
Viaarxiv icon