Picture for Adrian de Wynter

Adrian de Wynter

If Eleanor Rigby Had Met ChatGPT: A Study on Loneliness in a Post-LLM World

Add code
Dec 02, 2024
Viaarxiv icon

One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks

Add code
Oct 14, 2024
Figure 1 for One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Figure 2 for One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Figure 3 for One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Figure 4 for One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Viaarxiv icon

Awes, Laws, and Flaws From Today's LLM Research

Add code
Aug 29, 2024
Figure 1 for Awes, Laws, and Flaws From Today's LLM Research
Figure 2 for Awes, Laws, and Flaws From Today's LLM Research
Figure 3 for Awes, Laws, and Flaws From Today's LLM Research
Figure 4 for Awes, Laws, and Flaws From Today's LLM Research
Viaarxiv icon

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?

Add code
Apr 22, 2024
Viaarxiv icon

LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models

Add code
Apr 01, 2024
Viaarxiv icon

Will GPT-4 Run DOOM?

Add code
Mar 08, 2024
Viaarxiv icon

On Meta-Prompting

Add code
Dec 11, 2023
Viaarxiv icon

I Wish to Have an Argument: Argumentative Reasoning in Large Language Models

Add code
Sep 29, 2023
Viaarxiv icon

Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?

Add code
Sep 14, 2023
Viaarxiv icon

A User-Centered Evaluation of Spanish Text Simplification

Add code
Aug 15, 2023
Viaarxiv icon