Picture for Fernanda Viégas

Fernanda Viégas

Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner

Add code
Jun 17, 2024
Figure 1 for Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Figure 2 for Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Figure 3 for Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Figure 4 for Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Viaarxiv icon

Designing a Dashboard for Transparency and Control of Conversational AI

Add code
Jun 12, 2024
Viaarxiv icon

Measuring and Controlling Persona Drift in Language Model Dialogs

Add code
Feb 13, 2024
Figure 1 for Measuring and Controlling Persona Drift in Language Model Dialogs
Figure 2 for Measuring and Controlling Persona Drift in Language Model Dialogs
Figure 3 for Measuring and Controlling Persona Drift in Language Model Dialogs
Figure 4 for Measuring and Controlling Persona Drift in Language Model Dialogs
Viaarxiv icon

Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model

Add code
Jun 09, 2023
Viaarxiv icon

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Add code
Jun 07, 2023
Viaarxiv icon

AttentionViz: A Global View of Transformer Attention

Add code
May 04, 2023
Viaarxiv icon

The System Model and the User Model: Exploring AI Dashboard Design

Add code
May 04, 2023
Viaarxiv icon

Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task

Add code
Oct 25, 2022
Viaarxiv icon

An Interpretability Illusion for BERT

Add code
Apr 14, 2021
Figure 1 for An Interpretability Illusion for BERT
Figure 2 for An Interpretability Illusion for BERT
Figure 3 for An Interpretability Illusion for BERT
Figure 4 for An Interpretability Illusion for BERT
Viaarxiv icon

Segment Integrated Gradients: Better attributions through regions

Add code
Jun 06, 2019
Figure 1 for Segment Integrated Gradients: Better attributions through regions
Figure 2 for Segment Integrated Gradients: Better attributions through regions
Figure 3 for Segment Integrated Gradients: Better attributions through regions
Figure 4 for Segment Integrated Gradients: Better attributions through regions
Viaarxiv icon