Picture for Rachel Rudinger

Rachel Rudinger

Natural Language Inference Improves Compositionality in Vision-Language Models

Add code
Oct 29, 2024
Figure 1 for Natural Language Inference Improves Compositionality in Vision-Language Models
Figure 2 for Natural Language Inference Improves Compositionality in Vision-Language Models
Figure 3 for Natural Language Inference Improves Compositionality in Vision-Language Models
Figure 4 for Natural Language Inference Improves Compositionality in Vision-Language Models
Viaarxiv icon

Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S

Add code
Oct 23, 2024
Viaarxiv icon

Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?

Add code
Oct 20, 2024
Viaarxiv icon

Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning

Add code
Oct 06, 2024
Viaarxiv icon

On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models

Add code
Oct 05, 2024
Figure 1 for On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Figure 2 for On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Figure 3 for On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Figure 4 for On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Viaarxiv icon

Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?

Add code
Jul 02, 2024
Viaarxiv icon

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?

Add code
Jun 15, 2024
Viaarxiv icon

How often are errors in natural language reasoning due to paraphrastic variability?

Add code
Apr 17, 2024
Viaarxiv icon

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

Add code
Feb 19, 2024
Viaarxiv icon

Multilingual large language models leak human stereotypes across language boundaries

Add code
Dec 12, 2023
Viaarxiv icon