Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lauren Rhue

One world, one opinion? The superstar effect in LLM responses

Dec 13, 2024

Sofie Goethals, Lauren Rhue

Figure 1 for One world, one opinion? The superstar effect in LLM responses

Figure 2 for One world, one opinion? The superstar effect in LLM responses

Figure 3 for One world, one opinion? The superstar effect in LLM responses

Figure 4 for One world, one opinion? The superstar effect in LLM responses

Abstract:As large language models (LLMs) are shaping the way information is shared and accessed online, their opinions have the potential to influence a wide audience. This study examines who the LLMs view as the most prominent figures across various fields, using prompts in ten different languages to explore the influence of linguistic diversity. Our findings reveal low diversity in responses, with a small number of figures dominating recognition across languages (also known as the "superstar effect"). These results highlight the risk of narrowing global knowledge representation when LLMs retrieve subjective information.

Via

Access Paper or Ask Questions

Evaluating LLMs for Gender Disparities in Notable Persons

Mar 14, 2024

Lauren Rhue, Sofie Goethals, Arun Sundararajan

Figure 1 for Evaluating LLMs for Gender Disparities in Notable Persons

Figure 2 for Evaluating LLMs for Gender Disparities in Notable Persons

Figure 3 for Evaluating LLMs for Gender Disparities in Notable Persons

Figure 4 for Evaluating LLMs for Gender Disparities in Notable Persons

Abstract:This study examines the use of Large Language Models (LLMs) for retrieving factual information, addressing concerns over their propensity to produce factually incorrect "hallucinated" responses or to altogether decline to even answer prompt at all. Specifically, it investigates the presence of gender-based biases in LLMs' responses to factual inquiries. This paper takes a multi-pronged approach to evaluating GPT models by evaluating fairness across multiple dimensions of recall, hallucinations and declinations. Our findings reveal discernible gender disparities in the responses generated by GPT-3.5. While advancements in GPT-4 have led to improvements in performance, they have not fully eradicated these gender disparities, notably in instances where responses are declined. The study further explores the origins of these disparities by examining the influence of gender associations in prompts and the homogeneity in the responses.

Via

Access Paper or Ask Questions