Picture for Haoyi Qiu

Haoyi Qiu

Evaluating Cultural and Social Awareness of LLM Web Agents

Add code
Oct 30, 2024
Viaarxiv icon

VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models

Add code
Apr 22, 2024
Viaarxiv icon

From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

Add code
Mar 25, 2024
Viaarxiv icon

New Job, New Gender? Measuring the Social Bias in Image Generation Models

Add code
Jan 01, 2024
Viaarxiv icon

AMRFact: Enhancing Summarization Factuality Evaluation with AMR-driven Training Data Generation

Add code
Nov 16, 2023
Viaarxiv icon

Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning

Add code
May 24, 2023
Figure 1 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Figure 2 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Figure 3 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Figure 4 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Viaarxiv icon