Picture for Tyler McDonald

Tyler McDonald

Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index

Add code
Dec 02, 2024
Viaarxiv icon

NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers

Add code
Dec 02, 2024
Viaarxiv icon

STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions

Add code
Sep 20, 2024
Viaarxiv icon