Picture for Robert Morabito

Robert Morabito

STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions

Add code
Sep 20, 2024
Viaarxiv icon

Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

Add code
May 29, 2024
Viaarxiv icon

Debiasing should be Good and Bad: Measuring the Consistency of Debiasing Techniques in Language Models

Add code
May 23, 2023
Viaarxiv icon