Picture for Laura Perez-Beltrachini

Laura Perez-Beltrachini

Uncertainty Quantification in Retrieval Augmented Question Answering

Add code
Feb 25, 2025
Viaarxiv icon

Leveraging Entailment Judgements in Cross-Lingual Summarisation

Add code
Aug 01, 2024
Figure 1 for Leveraging Entailment Judgements in Cross-Lingual Summarisation
Figure 2 for Leveraging Entailment Judgements in Cross-Lingual Summarisation
Figure 3 for Leveraging Entailment Judgements in Cross-Lingual Summarisation
Figure 4 for Leveraging Entailment Judgements in Cross-Lingual Summarisation
Viaarxiv icon

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks

Add code
Feb 27, 2024
Viaarxiv icon

Improving User Controlled Table-To-Text Generation Robustness

Add code
Feb 20, 2023
Viaarxiv icon

Semantic Parsing for Conversational Question Answering over Knowledge Graphs

Add code
Jan 28, 2023
Viaarxiv icon

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Add code
Jun 24, 2022
Figure 1 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 2 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 3 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Figure 4 for GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Viaarxiv icon

Models and Datasets for Cross-Lingual Summarisation

Add code
Feb 19, 2022
Figure 1 for Models and Datasets for Cross-Lingual Summarisation
Figure 2 for Models and Datasets for Cross-Lingual Summarisation
Figure 3 for Models and Datasets for Cross-Lingual Summarisation
Figure 4 for Models and Datasets for Cross-Lingual Summarisation
Viaarxiv icon

Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

Add code
Jun 16, 2021
Figure 1 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Figure 2 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Figure 3 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Figure 4 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Viaarxiv icon

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

Add code
Feb 03, 2021
Figure 1 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Figure 2 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Figure 3 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Figure 4 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Viaarxiv icon