Picture for Samson Tan

Samson Tan

Learning to Generate Answers with Citations via Factual Consistency Models

Add code
Jun 19, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Figure 1 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 2 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 3 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 4 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Viaarxiv icon

Extreme Miscalibration and the Illusion of Adversarial Robustness

Add code
Feb 27, 2024
Viaarxiv icon

Automatic Feature Fairness in Recommendation via Adversaries

Add code
Sep 27, 2023
Viaarxiv icon

Large Language Models of Code Fail at Completing Code with Potential Bugs

Add code
Jun 06, 2023
Viaarxiv icon

ReCode: Robustness Evaluation of Code Generation Models

Add code
Dec 20, 2022
Figure 1 for ReCode: Robustness Evaluation of Code Generation Models
Figure 2 for ReCode: Robustness Evaluation of Code Generation Models
Figure 3 for ReCode: Robustness Evaluation of Code Generation Models
Figure 4 for ReCode: Robustness Evaluation of Code Generation Models
Viaarxiv icon

BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems

Add code
Nov 30, 2022
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Whodunit? Learning to Contrast for Authorship Attribution

Add code
Oct 10, 2022
Figure 1 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 2 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 3 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 4 for Whodunit? Learning to Contrast for Authorship Attribution
Viaarxiv icon

The Risks of Machine Learning Systems

Add code
Apr 21, 2022
Figure 1 for The Risks of Machine Learning Systems
Figure 2 for The Risks of Machine Learning Systems
Figure 3 for The Risks of Machine Learning Systems
Viaarxiv icon