Picture for Michael Krumdick

Michael Krumdick

No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding

Add code
Mar 07, 2025
Viaarxiv icon

Are Language Model Logits Calibrated?

Add code
Oct 21, 2024
Viaarxiv icon

SEC-QA: A Systematic Evaluation Corpus for Financial QA

Add code
Jun 20, 2024
Viaarxiv icon

An Analysis of Multilingual FActScore

Add code
Jun 20, 2024
Viaarxiv icon

BizBench: A Quantitative Reasoning Benchmark for Business and Finance

Add code
Nov 11, 2023
Viaarxiv icon

A Graphical Approach to Document Layout Analysis

Add code
Aug 03, 2023
Viaarxiv icon

APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection

Add code
Dec 17, 2019
Figure 1 for APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection
Figure 2 for APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection
Figure 3 for APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection
Figure 4 for APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection
Viaarxiv icon