Picture for Lucy Campbell-Gillingham

Lucy Campbell-Gillingham

Fine-tuning language models to find agreement among humans with diverse preferences

Add code
Nov 28, 2022
Viaarxiv icon

Improving alignment of dialogue agents via targeted human judgements

Add code
Sep 28, 2022
Figure 1 for Improving alignment of dialogue agents via targeted human judgements
Figure 2 for Improving alignment of dialogue agents via targeted human judgements
Figure 3 for Improving alignment of dialogue agents via targeted human judgements
Figure 4 for Improving alignment of dialogue agents via targeted human judgements
Viaarxiv icon

Teaching language models to support answers with verified quotes

Add code
Mar 21, 2022
Figure 1 for Teaching language models to support answers with verified quotes
Figure 2 for Teaching language models to support answers with verified quotes
Figure 3 for Teaching language models to support answers with verified quotes
Figure 4 for Teaching language models to support answers with verified quotes
Viaarxiv icon

HCMD-zero: Learning Value Aligned Mechanisms from Data

Add code
Feb 21, 2022
Figure 1 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Figure 2 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Figure 3 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Figure 4 for HCMD-zero: Learning Value Aligned Mechanisms from Data
Viaarxiv icon

Human-centered mechanism design with Democratic AI

Add code
Jan 27, 2022
Figure 1 for Human-centered mechanism design with Democratic AI
Figure 2 for Human-centered mechanism design with Democratic AI
Figure 3 for Human-centered mechanism design with Democratic AI
Figure 4 for Human-centered mechanism design with Democratic AI
Viaarxiv icon