Picture for Gonzalo Martínez

Gonzalo Martínez

To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times

Add code
Mar 12, 2026
Viaarxiv icon

Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings

Add code
Sep 17, 2025
Figure 1 for Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
Figure 2 for Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
Figure 3 for Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
Viaarxiv icon

The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations

Add code
Jul 17, 2025
Figure 1 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Figure 2 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Figure 3 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Figure 4 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Viaarxiv icon

La Leaderboard: A Large Language Model Leaderboard for Spanish Varieties and Languages of Spain and Latin America

Add code
Jul 01, 2025
Viaarxiv icon

Can ChatGPT Learn to Count Letters?

Add code
Feb 23, 2025
Figure 1 for Can ChatGPT Learn to Count Letters?
Figure 2 for Can ChatGPT Learn to Count Letters?
Figure 3 for Can ChatGPT Learn to Count Letters?
Figure 4 for Can ChatGPT Learn to Count Letters?
Viaarxiv icon

Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue

Add code
Feb 23, 2025
Figure 1 for Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue
Figure 2 for Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue
Figure 3 for Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue
Viaarxiv icon

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Add code
Jan 16, 2025
Figure 1 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Figure 2 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Figure 3 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Figure 4 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Viaarxiv icon

Open Source Conversational LLMs do not know most Spanish words

Add code
Mar 21, 2024
Figure 1 for Open Source Conversational LLMs do not know most Spanish words
Figure 2 for Open Source Conversational LLMs do not know most Spanish words
Figure 3 for Open Source Conversational LLMs do not know most Spanish words
Figure 4 for Open Source Conversational LLMs do not know most Spanish words
Viaarxiv icon

Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models

Add code
Feb 11, 2024
Figure 1 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Figure 2 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Figure 3 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Figure 4 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Viaarxiv icon

The continued usefulness of vocabulary tests for evaluating large language models

Add code
Oct 23, 2023
Figure 1 for The continued usefulness of vocabulary tests for evaluating large language models
Figure 2 for The continued usefulness of vocabulary tests for evaluating large language models
Figure 3 for The continued usefulness of vocabulary tests for evaluating large language models
Figure 4 for The continued usefulness of vocabulary tests for evaluating large language models
Viaarxiv icon