GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Add code
Sep 10, 2024
Figure 1 for GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering
Figure 2 for GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering
Figure 3 for GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering
Figure 4 for GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: