Picture for Claudia Shi

Claudia Shi

Hypothesis Testing the Circuit Hypothesis in LLMs

Add code
Oct 16, 2024
Viaarxiv icon

Causal-structure Driven Augmentations for Text OOD Generalization

Add code
Oct 19, 2023
Figure 1 for Causal-structure Driven Augmentations for Text OOD Generalization
Figure 2 for Causal-structure Driven Augmentations for Text OOD Generalization
Figure 3 for Causal-structure Driven Augmentations for Text OOD Generalization
Figure 4 for Causal-structure Driven Augmentations for Text OOD Generalization
Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Jul 27, 2023
Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Evaluating the Moral Beliefs Encoded in LLMs

Add code
Jul 26, 2023
Viaarxiv icon

An Invariant Learning Characterization of Controlled Text Generation

Add code
May 31, 2023
Viaarxiv icon

Invariant Representation Learning for Treatment Effect Estimation

Add code
Nov 24, 2020
Figure 1 for Invariant Representation Learning for Treatment Effect Estimation
Figure 2 for Invariant Representation Learning for Treatment Effect Estimation
Figure 3 for Invariant Representation Learning for Treatment Effect Estimation
Figure 4 for Invariant Representation Learning for Treatment Effect Estimation
Viaarxiv icon

Adapting Neural Networks for the Estimation of Treatment Effects

Add code
Jun 05, 2019
Figure 1 for Adapting Neural Networks for the Estimation of Treatment Effects
Figure 2 for Adapting Neural Networks for the Estimation of Treatment Effects
Figure 3 for Adapting Neural Networks for the Estimation of Treatment Effects
Figure 4 for Adapting Neural Networks for the Estimation of Treatment Effects
Viaarxiv icon