Picture for Eric Michael Smith

Eric Michael Smith

Jack

The Root Shapes the Fruit: On the Persistence of Gender-Exclusive Harms in Aligned Language Models

Add code
Nov 06, 2024
Figure 1 for The Root Shapes the Fruit: On the Persistence of Gender-Exclusive Harms in Aligned Language Models
Figure 2 for The Root Shapes the Fruit: On the Persistence of Gender-Exclusive Harms in Aligned Language Models
Figure 3 for The Root Shapes the Fruit: On the Persistence of Gender-Exclusive Harms in Aligned Language Models
Figure 4 for The Root Shapes the Fruit: On the Persistence of Gender-Exclusive Harms in Aligned Language Models
Viaarxiv icon

Persistent Pre-Training Poisoning of LLMs

Add code
Oct 17, 2024
Viaarxiv icon

Backtracking Improves Generation Safety

Add code
Sep 22, 2024
Figure 1 for Backtracking Improves Generation Safety
Figure 2 for Backtracking Improves Generation Safety
Figure 3 for Backtracking Improves Generation Safety
Figure 4 for Backtracking Improves Generation Safety
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Towards Safety and Helpfulness Balanced Responses via Controllable Large Language Models

Add code
Apr 01, 2024
Viaarxiv icon

ROBBIE: Robust Bias Evaluation of Large Generative Language Models

Add code
Nov 29, 2023
Figure 1 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 2 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 3 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 4 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Viaarxiv icon

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages

Add code
Aug 31, 2023
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Jul 19, 2023
Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

Improving Open Language Models by Learning from Organic Interactions

Add code
Jun 07, 2023
Viaarxiv icon

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

Add code
Aug 10, 2022
Figure 1 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 2 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 3 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 4 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Viaarxiv icon