Picture for Mark Dredze

Mark Dredze

DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation

Add code
Dec 17, 2024
Figure 1 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 2 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 3 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Figure 4 for DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
Viaarxiv icon

Making FETCH! Happen: Finding Emergent Dog Whistles Through Common Habitats

Add code
Dec 16, 2024
Viaarxiv icon

Are Clinical T5 Models Better for Clinical Text?

Add code
Dec 08, 2024
Figure 1 for Are Clinical T5 Models Better for Clinical Text?
Figure 2 for Are Clinical T5 Models Better for Clinical Text?
Figure 3 for Are Clinical T5 Models Better for Clinical Text?
Figure 4 for Are Clinical T5 Models Better for Clinical Text?
Viaarxiv icon

Give me Some Hard Questions: Synthetic Data Generation for Clinical QA

Add code
Dec 05, 2024
Figure 1 for Give me Some Hard Questions: Synthetic Data Generation for Clinical QA
Figure 2 for Give me Some Hard Questions: Synthetic Data Generation for Clinical QA
Figure 3 for Give me Some Hard Questions: Synthetic Data Generation for Clinical QA
Figure 4 for Give me Some Hard Questions: Synthetic Data Generation for Clinical QA
Viaarxiv icon

Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts

Add code
Oct 14, 2024
Figure 1 for Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts
Figure 2 for Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts
Figure 3 for Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts
Figure 4 for Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts
Viaarxiv icon

Can Optimization Trajectories Explain Multi-Task Transfer?

Add code
Aug 26, 2024
Viaarxiv icon

Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models

Add code
Aug 14, 2024
Viaarxiv icon

A Closer Look at Claim Decomposition

Add code
Mar 18, 2024
Figure 1 for A Closer Look at Claim Decomposition
Figure 2 for A Closer Look at Claim Decomposition
Figure 3 for A Closer Look at Claim Decomposition
Figure 4 for A Closer Look at Claim Decomposition
Viaarxiv icon

Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions

Add code
Mar 13, 2024
Viaarxiv icon

Evaluating Biases in Context-Dependent Health Questions

Add code
Mar 07, 2024
Viaarxiv icon