Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

Nov 23, 2022

Amanpreet Singh, Mike D'Arcy, Arman Cohan, Doug Downey, Sergey Feldman

Figure 1 for SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

Figure 2 for SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

Figure 3 for SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

Figure 4 for SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

Share this with someone who'll enjoy it:

Abstract:Learned representations of scientific documents can serve as valuable input features for downstream tasks, without the need for further fine-tuning. However, existing benchmarks for evaluating these representations fail to capture the diversity of relevant tasks. In response, we introduce SciRepEval, the first comprehensive benchmark for training and evaluating scientific document representations. It includes 25 challenging and realistic tasks, 11 of which are new, across four formats: classification, regression, ranking and search. We then use the benchmark to study and improve the generalization ability of scientific document representation models. We show how state-of-the-art models struggle to generalize across task formats, and that simple multi-task training fails to improve them. However, a new approach that learns multiple embeddings per document, each tailored to a different format, can improve performance. We experiment with task-format-specific control codes and adapters in a multi-task setting and find that they outperform the existing single-embedding state-of-the-art by up to 1.5 points absolute.

* 21 pages, 2 figures, 9 tables. For associated code, see https://github.com/allenai/scirepeval

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

Paper and Code