Picture for Rotem Dror

Rotem Dror

State of What Art? A Call for Multi-Prompt LLM Evaluation

Add code
Dec 31, 2023
Viaarxiv icon

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Add code
Nov 21, 2023
Figure 1 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Figure 2 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Figure 3 for DMLR: Data-centric Machine Learning Research -- Past, Present and Future
Viaarxiv icon

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics

Add code
Oct 30, 2023
Figure 1 for The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Figure 2 for The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Figure 3 for The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Figure 4 for The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Viaarxiv icon

Human-in-the-Loop Schema Induction

Add code
Feb 25, 2023
Viaarxiv icon

On the Limitations of Reference-Free Evaluations of Generated Text

Add code
Oct 22, 2022
Viaarxiv icon

Zero-Shot On-the-Fly Event Schema Induction

Add code
Oct 12, 2022
Figure 1 for Zero-Shot On-the-Fly Event Schema Induction
Figure 2 for Zero-Shot On-the-Fly Event Schema Induction
Figure 3 for Zero-Shot On-the-Fly Event Schema Induction
Figure 4 for Zero-Shot On-the-Fly Event Schema Induction
Viaarxiv icon

Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics

Add code
Apr 21, 2022
Figure 1 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Figure 2 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Figure 3 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Figure 4 for Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Viaarxiv icon

A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods

Add code
Mar 31, 2021
Figure 1 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Figure 2 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Figure 3 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Figure 4 for A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods
Viaarxiv icon

The Structured Weighted Violations MIRA

Add code
May 09, 2020
Figure 1 for The Structured Weighted Violations MIRA
Figure 2 for The Structured Weighted Violations MIRA
Figure 3 for The Structured Weighted Violations MIRA
Figure 4 for The Structured Weighted Violations MIRA
Viaarxiv icon

Appendix - Recommended Statistical Significance Tests for NLP Tasks

Add code
Sep 05, 2018
Viaarxiv icon