Picture for Ruosen Li

Ruosen Li

IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering

Add code
Aug 24, 2024
Figure 1 for IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Figure 2 for IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Figure 3 for IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Figure 4 for IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Viaarxiv icon

Leveraging Structured Information for Explainable Multi-hop Question Answering and Reasoning

Add code
Nov 07, 2023
Viaarxiv icon

FAITHSCORE: Evaluating Hallucinations in Large Vision-Language Models

Add code
Nov 02, 2023
Viaarxiv icon

PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations

Add code
Jul 06, 2023
Viaarxiv icon