Picture for Robert West

Robert West

Localized Cultural Knowledge is Conserved and Controllable in Large Language Models

Add code
Apr 14, 2025
Viaarxiv icon

Controlling Latent Diffusion Using Latent CLIP

Add code
Mar 11, 2025
Viaarxiv icon

Generating Structured Outputs from Language Models: Benchmark and Studies

Add code
Jan 18, 2025
Viaarxiv icon

Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?

Add code
Dec 21, 2024
Figure 1 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Figure 2 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Figure 3 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Figure 4 for Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Viaarxiv icon

Byte BPE Tokenization as an Inverse string Homomorphism

Add code
Dec 04, 2024
Figure 1 for Byte BPE Tokenization as an Inverse string Homomorphism
Figure 2 for Byte BPE Tokenization as an Inverse string Homomorphism
Figure 3 for Byte BPE Tokenization as an Inverse string Homomorphism
Figure 4 for Byte BPE Tokenization as an Inverse string Homomorphism
Viaarxiv icon

Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers

Add code
Nov 13, 2024
Figure 1 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Figure 2 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Figure 3 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Figure 4 for Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers
Viaarxiv icon

Controllable Context Sensitivity and the Knob Behind It

Add code
Nov 11, 2024
Figure 1 for Controllable Context Sensitivity and the Knob Behind It
Figure 2 for Controllable Context Sensitivity and the Knob Behind It
Figure 3 for Controllable Context Sensitivity and the Knob Behind It
Figure 4 for Controllable Context Sensitivity and the Knob Behind It
Viaarxiv icon

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

Add code
Nov 07, 2024
Viaarxiv icon

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Add code
Oct 28, 2024
Figure 1 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Figure 2 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Figure 3 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Figure 4 for Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Viaarxiv icon

Activation Scaling for Steering and Interpreting Language Models

Add code
Oct 07, 2024
Viaarxiv icon