Picture for Samuel Joseph Amouyal

Samuel Joseph Amouyal

GLEE: A Unified Framework and Benchmark for Language-based Economic Environments

Add code
Oct 07, 2024
Viaarxiv icon

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Add code
Jul 22, 2024
Viaarxiv icon

Large Language Models for Psycholinguistic Plausibility Pretesting

Add code
Feb 08, 2024
Viaarxiv icon

QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs

Add code
May 26, 2022
Figure 1 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Figure 2 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Figure 3 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Figure 4 for QAMPARI: : An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
Viaarxiv icon