Picture for Tyler McDonald

Tyler McDonald

Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition

Add code
Apr 30, 2025
Figure 1 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Figure 2 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Figure 3 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Figure 4 for Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Viaarxiv icon

Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index

Add code
Dec 02, 2024
Figure 1 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Figure 2 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Figure 3 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Figure 4 for Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Viaarxiv icon

NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers

Add code
Dec 02, 2024
Figure 1 for NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers
Figure 2 for NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers
Figure 3 for NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers
Figure 4 for NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers
Viaarxiv icon

STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions

Add code
Sep 20, 2024
Figure 1 for STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
Figure 2 for STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
Figure 3 for STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
Figure 4 for STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
Viaarxiv icon