Picture for Daniel G. Brown

Daniel G. Brown

Assessing Language Models' Worldview for Fiction Generation

Add code
Aug 15, 2024
Viaarxiv icon

TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability

Add code
Jun 04, 2024
Viaarxiv icon

A Study on Large Language Models' Limitations in Multiple-Choice Question Answering

Add code
Jan 15, 2024
Viaarxiv icon

Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt Wording

Add code
Jun 09, 2023
Viaarxiv icon

Crowd Score: A Method for the Evaluation of Jokes using Large Language Model AI Voters as Judges

Add code
Dec 21, 2022
Viaarxiv icon