Picture for Mario Giulianelli

Mario Giulianelli

Shammie

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Add code
Jul 03, 2025
Viaarxiv icon

Language Models over Canonical Byte-Pair Encodings

Add code
Jun 09, 2025
Viaarxiv icon

Information Locality as an Inductive Bias for Neural Language Models

Add code
Jun 05, 2025
Viaarxiv icon

The Harmonic Structure of Information Contours

Add code
Jun 04, 2025
Viaarxiv icon

Playpen: An Environment for Exploring Learning Through Conversational Interaction

Add code
Apr 11, 2025
Viaarxiv icon

Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests

Add code
Feb 20, 2025
Viaarxiv icon

From Language Models over Tokens to Language Models over Characters

Add code
Dec 04, 2024
Viaarxiv icon

Towards a Similarity-adjusted Surprisal Theory

Add code
Oct 23, 2024
Viaarxiv icon

Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse

Add code
Oct 21, 2024
Viaarxiv icon

On the Proper Treatment of Tokenization in Psycholinguistics

Add code
Oct 03, 2024
Figure 1 for On the Proper Treatment of Tokenization in Psycholinguistics
Figure 2 for On the Proper Treatment of Tokenization in Psycholinguistics
Figure 3 for On the Proper Treatment of Tokenization in Psycholinguistics
Figure 4 for On the Proper Treatment of Tokenization in Psycholinguistics
Viaarxiv icon