Picture for Zongxia Li

Zongxia Li

SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement

Add code
Sep 28, 2024
Viaarxiv icon

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?

Add code
Jun 15, 2024
Viaarxiv icon

PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation

Add code
Feb 17, 2024
Viaarxiv icon

Beyond Automated Evaluation Metrics: Evaluating Topic Models On Practical Social Science Content Analysis Tasks

Add code
Jan 29, 2024
Viaarxiv icon

CFMatch: Aligning Automated Answer Equivalence Evaluation with Expert Judgments For Open-Domain Question Answering

Add code
Jan 24, 2024
Viaarxiv icon

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V, LLaVA-1.5, and Other Multi-modality Models

Add code
Oct 23, 2023
Viaarxiv icon

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps

Add code
Jul 11, 2023
Viaarxiv icon

SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models

Add code
Oct 13, 2022
Figure 1 for SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Figure 2 for SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Figure 3 for SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Figure 4 for SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Viaarxiv icon