Picture for Guillermo Marco

Guillermo Marco

None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks

Add code
Feb 18, 2025
Viaarxiv icon

Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs

Add code
Sep 17, 2024
Viaarxiv icon

Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing?

Add code
Jul 01, 2024
Viaarxiv icon