Picture for Luis Fernando D'Haro

Luis Fernando D'Haro

EJ

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models

Add code
Sep 27, 2024
Viaarxiv icon

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Add code
Jun 10, 2024
Figure 1 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 2 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 3 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 4 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Viaarxiv icon

Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models

Add code
May 23, 2024
Viaarxiv icon

Awareness in robotics: An early perspective from the viewpoint of the EIC Pathfinder Challenge "Awareness Inside''

Add code
Feb 14, 2024
Viaarxiv icon

A Comprehensive Analysis of the Effectiveness of Large Language Models as Automatic Dialogue Evaluators

Add code
Dec 24, 2023
Viaarxiv icon

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark

Add code
Oct 13, 2023
Viaarxiv icon

Overview of Robust and Multilingual Automatic Evaluation Metrics for Open-Domain Dialogue Systems at DSTC 11 Track 4

Add code
Jun 22, 2023
Viaarxiv icon

PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment

Add code
Dec 18, 2022
Viaarxiv icon

FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation

Add code
Oct 29, 2022
Viaarxiv icon

Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges

Add code
Mar 18, 2022
Figure 1 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Figure 2 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Viaarxiv icon