Picture for Ali Emami

Ali Emami

Fine-Tuned LLMs are "Time Capsules" for Tracking Societal Bias Through Books

Add code
Feb 07, 2025
Viaarxiv icon

Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts

Add code
Feb 05, 2025
Viaarxiv icon

Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index

Add code
Dec 02, 2024
Figure 1 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Figure 2 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Figure 3 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Figure 4 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Viaarxiv icon

NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers

Add code
Dec 02, 2024
Viaarxiv icon

MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models

Add code
Sep 24, 2024
Viaarxiv icon

STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions

Add code
Sep 20, 2024
Viaarxiv icon

Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

Add code
May 29, 2024
Viaarxiv icon

Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge

Add code
May 28, 2024
Viaarxiv icon

Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models

Add code
May 23, 2024
Figure 1 for Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Figure 2 for Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Figure 3 for Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Figure 4 for Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Viaarxiv icon

EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries

Add code
Feb 22, 2024
Viaarxiv icon