Picture for Laura Dietz

Laura Dietz

Supporting Humans in Evaluating AI Summaries of Legal Depositions

Add code
Jan 21, 2026
Viaarxiv icon

Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

Add code
Jan 19, 2026
Viaarxiv icon

Incorporating Q&A Nuggets into Retrieval-Augmented Generation

Add code
Jan 19, 2026
Viaarxiv icon

UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction

Add code
Sep 08, 2025
Viaarxiv icon

LLM-Evaluation Tropes: Perspectives on the Validity of LLM-Evaluations

Add code
Apr 27, 2025
Viaarxiv icon

LLM-based relevance assessment still can't replace human relevance assessment

Add code
Dec 22, 2024
Figure 1 for LLM-based relevance assessment still can't replace human relevance assessment
Viaarxiv icon

Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3

Add code
Oct 17, 2024
Figure 1 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Figure 2 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Figure 3 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Figure 4 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Viaarxiv icon

A Workbench for Autograding Retrieve/Generate Systems

Add code
May 21, 2024
Figure 1 for A Workbench for Autograding Retrieve/Generate Systems
Figure 2 for A Workbench for Autograding Retrieve/Generate Systems
Figure 3 for A Workbench for Autograding Retrieve/Generate Systems
Figure 4 for A Workbench for Autograding Retrieve/Generate Systems
Viaarxiv icon

An Exam-based Evaluation Approach Beyond Traditional Relevance Judgments

Add code
Feb 01, 2024
Viaarxiv icon

Fine-grained Forecasting Models Via Gaussian Process Blurring Effect

Add code
Dec 21, 2023
Viaarxiv icon