Picture for Byron C. Wallace

Byron C. Wallace

Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation

Add code
Nov 26, 2024
Viaarxiv icon

Open (Clinical) LLMs are Sensitive to Instruction Phrasings

Add code
Jul 12, 2024
Viaarxiv icon

Detection and Measurement of Syntactic Templates in Generated Text

Add code
Jun 28, 2024
Figure 1 for Detection and Measurement of Syntactic Templates in Generated Text
Figure 2 for Detection and Measurement of Syntactic Templates in Generated Text
Figure 3 for Detection and Measurement of Syntactic Templates in Generated Text
Figure 4 for Detection and Measurement of Syntactic Templates in Generated Text
Viaarxiv icon

Investigating Mysteries of CoT-Augmented Distillation

Add code
Jun 20, 2024
Viaarxiv icon

Learning from Natural Language Explanations for Generalizable Entity Matching

Add code
Jun 13, 2024
Viaarxiv icon

Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models

Add code
May 02, 2024
Viaarxiv icon

Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores

Add code
Mar 01, 2024
Viaarxiv icon

How Much Annotation is Needed to Compare Summarization Models?

Add code
Feb 28, 2024
Viaarxiv icon

Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study

Add code
Feb 24, 2024
Viaarxiv icon

GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence

Add code
Feb 19, 2024
Figure 1 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Figure 2 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Figure 3 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Figure 4 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Viaarxiv icon