Picture for Gili Lior

Gili Lior

WildIFEval: Instruction Following in the Wild

Add code
Mar 09, 2025
Viaarxiv icon

WildFrame: Comparing Framing in Humans and LLMs on Naturally Occurring Texts

Add code
Feb 24, 2025
Viaarxiv icon

SEAM: A Stochastic Benchmark for Multi-Document Tasks

Add code
Jun 23, 2024
Figure 1 for SEAM: A Stochastic Benchmark for Multi-Document Tasks
Figure 2 for SEAM: A Stochastic Benchmark for Multi-Document Tasks
Figure 3 for SEAM: A Stochastic Benchmark for Multi-Document Tasks
Figure 4 for SEAM: A Stochastic Benchmark for Multi-Document Tasks
Viaarxiv icon

Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction

Add code
Feb 21, 2024
Viaarxiv icon

Comparing Humans and Models on a Similar Scale: Towards Cognitive Gender Bias Evaluation in Coreference Resolution

Add code
May 24, 2023
Viaarxiv icon