Picture for Dongryeol Lee

Dongryeol Lee

Judging Against the Reference: Uncovering Knowledge-Driven Failures in LLM-Judges on QA Evaluation

Add code
Jan 12, 2026
Viaarxiv icon

Can You Trick the Grader? Adversarial Persuasion of LLM Judges

Add code
Aug 11, 2025
Viaarxiv icon

Don't Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation

Add code
May 22, 2025
Viaarxiv icon

Fooling the LVLM Judges: Visual Biases in LVLM-Based Evaluation

Add code
May 21, 2025
Figure 1 for Fooling the LVLM Judges: Visual Biases in LVLM-Based Evaluation
Figure 2 for Fooling the LVLM Judges: Visual Biases in LVLM-Based Evaluation
Figure 3 for Fooling the LVLM Judges: Visual Biases in LVLM-Based Evaluation
Figure 4 for Fooling the LVLM Judges: Visual Biases in LVLM-Based Evaluation
Viaarxiv icon

Generating Diverse Hypotheses for Inductive Reasoning

Add code
Dec 18, 2024
Viaarxiv icon

Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation

Add code
Oct 28, 2024
Viaarxiv icon

VLind-Bench: Measuring Language Priors in Large Vision-Language Models

Add code
Jun 17, 2024
Viaarxiv icon

Return of EM: Entity-driven Answer Set Expansion for QA Evaluation

Add code
Apr 24, 2024
Figure 1 for Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Figure 2 for Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Figure 3 for Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Figure 4 for Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Viaarxiv icon

Asking Clarification Questions to Handle Ambiguity in Open-Domain QA

Add code
May 23, 2023
Figure 1 for Asking Clarification Questions to Handle Ambiguity in Open-Domain QA
Figure 2 for Asking Clarification Questions to Handle Ambiguity in Open-Domain QA
Figure 3 for Asking Clarification Questions to Handle Ambiguity in Open-Domain QA
Figure 4 for Asking Clarification Questions to Handle Ambiguity in Open-Domain QA
Viaarxiv icon

Plug-and-play dual-tree algorithm runtime analysis

Add code
Jan 21, 2015
Figure 1 for Plug-and-play dual-tree algorithm runtime analysis
Figure 2 for Plug-and-play dual-tree algorithm runtime analysis
Figure 3 for Plug-and-play dual-tree algorithm runtime analysis
Figure 4 for Plug-and-play dual-tree algorithm runtime analysis
Viaarxiv icon