Picture for Kevin Du

Kevin Du

Controllable Context Sensitivity and the Knob Behind It

Add code
Nov 11, 2024
Figure 1 for Controllable Context Sensitivity and the Knob Behind It
Figure 2 for Controllable Context Sensitivity and the Knob Behind It
Figure 3 for Controllable Context Sensitivity and the Knob Behind It
Figure 4 for Controllable Context Sensitivity and the Knob Behind It
Viaarxiv icon

Efficiently Computing Susceptibility to Context in Language Models

Add code
Oct 18, 2024
Figure 1 for Efficiently Computing Susceptibility to Context in Language Models
Figure 2 for Efficiently Computing Susceptibility to Context in Language Models
Figure 3 for Efficiently Computing Susceptibility to Context in Language Models
Figure 4 for Efficiently Computing Susceptibility to Context in Language Models
Viaarxiv icon

Activation Scaling for Steering and Interpreting Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Context versus Prior Knowledge in Language Models

Add code
Apr 06, 2024
Viaarxiv icon

Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

Add code
Nov 30, 2023
Viaarxiv icon

Generalizing Backpropagation for Gradient-Based Interpretability

Add code
Jul 06, 2023
Figure 1 for Generalizing Backpropagation for Gradient-Based Interpretability
Figure 2 for Generalizing Backpropagation for Gradient-Based Interpretability
Figure 3 for Generalizing Backpropagation for Gradient-Based Interpretability
Figure 4 for Generalizing Backpropagation for Gradient-Based Interpretability
Viaarxiv icon

AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process

Add code
Nov 17, 2022
Viaarxiv icon