Picture for Leshem Choshen

Leshem Choshen

Can Gradient Descent Simulate Prompting?

Add code
Jun 26, 2025
Viaarxiv icon

TextArena

Add code
Apr 15, 2025
Viaarxiv icon

Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Add code
Apr 10, 2025
Viaarxiv icon

Pretraining Language Models for Diachronic Linguistic Change Discovery

Add code
Apr 09, 2025
Viaarxiv icon

DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation

Add code
Mar 04, 2025
Viaarxiv icon

The Mighty ToRR: A Benchmark for Table Reasoning and Robustness

Add code
Feb 26, 2025
Viaarxiv icon

Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families

Add code
Dec 09, 2024
Viaarxiv icon

Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Add code
Dec 06, 2024
Viaarxiv icon

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Add code
Dec 04, 2024
Figure 1 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 2 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 3 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 4 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Viaarxiv icon

ZipNN: Lossless Compression for AI Models

Add code
Nov 07, 2024
Figure 1 for ZipNN: Lossless Compression for AI Models
Figure 2 for ZipNN: Lossless Compression for AI Models
Figure 3 for ZipNN: Lossless Compression for AI Models
Figure 4 for ZipNN: Lossless Compression for AI Models
Viaarxiv icon