Picture for Robert West

Robert West

Generating Structured Outputs from Language Models: Benchmark and Studies

Add code
Jan 18, 2025
Viaarxiv icon

Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?

Add code
Dec 21, 2024
Figure 1 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Figure 2 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Figure 3 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Figure 4 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Viaarxiv icon

Byte BPE Tokenization as an Inverse string Homomorphism

Add code
Dec 04, 2024
Figure 1 for Byte BPE Tokenization as an Inverse string Homomorphism
Figure 2 for Byte BPE Tokenization as an Inverse string Homomorphism
Figure 3 for Byte BPE Tokenization as an Inverse string Homomorphism
Figure 4 for Byte BPE Tokenization as an Inverse string Homomorphism
Viaarxiv icon

Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers

Add code
Nov 13, 2024
Figure 1 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Figure 2 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Figure 3 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Figure 4 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Viaarxiv icon

Controllable Context Sensitivity and the Knob Behind It

Add code
Nov 11, 2024
Figure 1 for Controllable Context Sensitivity and the Knob Behind It
Figure 2 for Controllable Context Sensitivity and the Knob Behind It
Figure 3 for Controllable Context Sensitivity and the Knob Behind It
Figure 4 for Controllable Context Sensitivity and the Knob Behind It
Viaarxiv icon

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

Add code
Nov 07, 2024
Viaarxiv icon

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Add code
Oct 28, 2024
Figure 1 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Figure 2 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Figure 3 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Figure 4 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Viaarxiv icon

Activation Scaling for Steering and Interpreting Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia

Add code
Oct 05, 2024
Figure 1 for Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Figure 2 for Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Figure 3 for Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Figure 4 for Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Viaarxiv icon

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

Add code
Aug 07, 2024
Figure 1 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 2 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 3 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 4 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Viaarxiv icon