Picture for Avia Efrat

Avia Efrat

Shammie

ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding

Add code
May 23, 2023
Figure 1 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Figure 2 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Figure 3 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Figure 4 for ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding
Viaarxiv icon

LIMA: Less Is More for Alignment

Add code
May 18, 2023
Viaarxiv icon

LMentry: A Language Model Benchmark of Elementary Language Tasks

Add code
Nov 03, 2022
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

SCROLLS: Standardized CompaRison Over Long Language Sequences

Add code
Jan 10, 2022
Figure 1 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 2 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 3 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Figure 4 for SCROLLS: Standardized CompaRison Over Long Language Sequences
Viaarxiv icon

How Optimal is Greedy Decoding for Extractive Question Answering?

Add code
Aug 12, 2021
Figure 1 for How Optimal is Greedy Decoding for Extractive Question Answering?
Figure 2 for How Optimal is Greedy Decoding for Extractive Question Answering?
Figure 3 for How Optimal is Greedy Decoding for Extractive Question Answering?
Figure 4 for How Optimal is Greedy Decoding for Extractive Question Answering?
Viaarxiv icon

Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language

Add code
Mar 01, 2021
Figure 1 for Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language
Figure 2 for Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language
Figure 3 for Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language
Figure 4 for Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language
Viaarxiv icon

The Turking Test: Can Language Models Understand Instructions?

Add code
Oct 22, 2020
Figure 1 for The Turking Test: Can Language Models Understand Instructions?
Figure 2 for The Turking Test: Can Language Models Understand Instructions?
Figure 3 for The Turking Test: Can Language Models Understand Instructions?
Figure 4 for The Turking Test: Can Language Models Understand Instructions?
Viaarxiv icon

Tag-based Multi-Span Extraction in Reading Comprehension

Add code
Oct 02, 2019
Figure 1 for Tag-based Multi-Span Extraction in Reading Comprehension
Figure 2 for Tag-based Multi-Span Extraction in Reading Comprehension
Figure 3 for Tag-based Multi-Span Extraction in Reading Comprehension
Figure 4 for Tag-based Multi-Span Extraction in Reading Comprehension
Viaarxiv icon