Picture for Saffron Huang

Saffron Huang

Clio: Privacy-Preserving Insights into Real-World AI Use

Add code
Dec 18, 2024
Viaarxiv icon

Collective Constitutional AI: Aligning a Language Model with Public Input

Add code
Jun 12, 2024
Viaarxiv icon

Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks

Add code
May 17, 2024
Figure 1 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Figure 2 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Figure 3 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Figure 4 for Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Viaarxiv icon

Red Teaming Language Models with Language Models

Add code
Feb 07, 2022
Figure 1 for Red Teaming Language Models with Language Models
Figure 2 for Red Teaming Language Models with Language Models
Figure 3 for Red Teaming Language Models with Language Models
Figure 4 for Red Teaming Language Models with Language Models
Viaarxiv icon

Improving language models by retrieving from trillions of tokens

Add code
Jan 11, 2022
Figure 1 for Improving language models by retrieving from trillions of tokens
Figure 2 for Improving language models by retrieving from trillions of tokens
Figure 3 for Improving language models by retrieving from trillions of tokens
Figure 4 for Improving language models by retrieving from trillions of tokens
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Dec 08, 2021
Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon