Picture for Alberto Testoni

Alberto Testoni

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Add code
Jun 26, 2024
Figure 1 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 2 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 3 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 4 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Viaarxiv icon

Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and Expected Information Gain

Add code
Jun 25, 2024
Viaarxiv icon

Naming, Describing, and Quantifying Visual Objects in Humans and LLMs

Add code
Mar 13, 2024
Viaarxiv icon

Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions

Add code
Feb 09, 2024
Viaarxiv icon

Are Current Decoding Strategies Capable of Facing the Challenges of Visual Dialogue?

Add code
Oct 24, 2022
Viaarxiv icon

Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy

Add code
Sep 11, 2021
Figure 1 for Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
Figure 2 for Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
Figure 3 for Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
Figure 4 for Looking for Confirmations: An Effective and Human-Like Visual Dialogue Strategy
Viaarxiv icon

Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training

Add code
Mar 30, 2021
Figure 1 for Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training
Figure 2 for Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training
Figure 3 for Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training
Figure 4 for Overprotective Training Environments Fall Short at Testing Time: Let Models Contribute to Their Own Training
Viaarxiv icon

The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues

Add code
Mar 20, 2021
Figure 1 for The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
Figure 2 for The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
Figure 3 for The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
Figure 4 for The Interplay of Task Success and Dialogue Quality: An in-depth Evaluation in Task-Oriented Visual Dialogues
Viaarxiv icon

Grounded Textual Entailment

Add code
Jun 14, 2018
Figure 1 for Grounded Textual Entailment
Figure 2 for Grounded Textual Entailment
Figure 3 for Grounded Textual Entailment
Figure 4 for Grounded Textual Entailment
Viaarxiv icon