Picture for Shibani Santurkar

Shibani Santurkar

Whose Opinions Do Language Models Reflect?

Add code
Mar 30, 2023
Viaarxiv icon

Data Selection for Language Models via Importance Resampling

Add code
Feb 06, 2023
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Nov 16, 2022
Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning

Add code
Jul 15, 2022
Figure 1 for Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning
Figure 2 for Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning
Figure 3 for Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning
Figure 4 for Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning
Viaarxiv icon

Editing a classifier by rewriting its prediction rules

Add code
Dec 02, 2021
Figure 1 for Editing a classifier by rewriting its prediction rules
Figure 2 for Editing a classifier by rewriting its prediction rules
Figure 3 for Editing a classifier by rewriting its prediction rules
Figure 4 for Editing a classifier by rewriting its prediction rules
Viaarxiv icon

3DB: A Framework for Debugging Computer Vision Models

Add code
Jun 07, 2021
Figure 1 for 3DB: A Framework for Debugging Computer Vision Models
Figure 2 for 3DB: A Framework for Debugging Computer Vision Models
Figure 3 for 3DB: A Framework for Debugging Computer Vision Models
Figure 4 for 3DB: A Framework for Debugging Computer Vision Models
Viaarxiv icon

Leveraging Sparse Linear Layers for Debuggable Deep Networks

Add code
May 11, 2021
Figure 1 for Leveraging Sparse Linear Layers for Debuggable Deep Networks
Figure 2 for Leveraging Sparse Linear Layers for Debuggable Deep Networks
Figure 3 for Leveraging Sparse Linear Layers for Debuggable Deep Networks
Figure 4 for Leveraging Sparse Linear Layers for Debuggable Deep Networks
Viaarxiv icon

BREEDS: Benchmarks for Subpopulation Shift

Add code
Aug 11, 2020
Figure 1 for BREEDS: Benchmarks for Subpopulation Shift
Figure 2 for BREEDS: Benchmarks for Subpopulation Shift
Figure 3 for BREEDS: Benchmarks for Subpopulation Shift
Figure 4 for BREEDS: Benchmarks for Subpopulation Shift
Viaarxiv icon

Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO

Add code
May 25, 2020
Figure 1 for Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
Figure 2 for Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
Figure 3 for Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
Figure 4 for Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
Viaarxiv icon

From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Add code
May 22, 2020
Figure 1 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks
Figure 2 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks
Figure 3 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks
Figure 4 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks
Viaarxiv icon