Picture for Megan Ung

Megan Ung

Improving Model Evaluation using SMART Filtering of Benchmark Datasets

Add code
Oct 26, 2024
Viaarxiv icon

Changing Answer Order Can Decrease MMLU Accuracy

Add code
Jun 27, 2024
Figure 1 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 2 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 3 for Changing Answer Order Can Decrease MMLU Accuracy
Figure 4 for Changing Answer Order Can Decrease MMLU Accuracy
Viaarxiv icon

ROBBIE: Robust Bias Evaluation of Large Generative Language Models

Add code
Nov 29, 2023
Figure 1 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 2 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 3 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Figure 4 for ROBBIE: Robust Bias Evaluation of Large Generative Language Models
Viaarxiv icon

Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts

Add code
Jul 06, 2023
Viaarxiv icon

Improving Open Language Models by Learning from Organic Interactions

Add code
Jun 07, 2023
Viaarxiv icon

Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback

Add code
Aug 16, 2022
Figure 1 for Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback
Figure 2 for Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback
Figure 3 for Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback
Figure 4 for Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback
Viaarxiv icon

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

Add code
Aug 10, 2022
Figure 1 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 2 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 3 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Figure 4 for BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Viaarxiv icon

SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures

Add code
Oct 14, 2021
Figure 1 for SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures
Figure 2 for SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures
Figure 3 for SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures
Figure 4 for SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety Failures
Viaarxiv icon