Picture for David Pantoja

David Pantoja

Improving Model Evaluation using SMART Filtering of Benchmark Datasets

Add code
Oct 26, 2024
Viaarxiv icon

ESPERANTO: Evaluating Synthesized Phrases to Enhance Robustness in AI Detection for Text Origination

Add code
Sep 22, 2024
Viaarxiv icon

Changing Answer Order Can Decrease MMLU Accuracy

Add code
Jun 27, 2024
Viaarxiv icon