Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Dec 03, 2022

Arshiya Aggarwal, Jiao Sun, Nanyun Peng

Figure 1 for Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Figure 2 for Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Figure 3 for Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Figure 4 for Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Share this with someone who'll enjoy it:

Abstract:We present a robust methodology for evaluating biases in natural language generation(NLG) systems. Previous works use fixed hand-crafted prefix templates with mentions of various demographic groups to prompt models to generate continuations for bias analysis. These fixed prefix templates could themselves be specific in terms of styles or linguistic structures, which may lead to unreliable fairness conclusions that are not representative of the general trends from tone varying prompts. To study this problem, we paraphrase the prompts with different syntactic structures and use these to evaluate demographic bias in NLG systems. Our results suggest similar overall bias trends but some syntactic structures lead to contradictory conclusions compared to past works. We show that our methodology is more robust and that some syntactic structures prompt more toxic content while others could prompt less biased generation. This suggests the importance of not relying on a fixed syntactic structure and using tone-invariant prompts. Introducing syntactically-diverse prompts can achieve more robust NLG (bias) evaluation.

* EMNLP Findings 2022

View paper on

Share this with someone who'll enjoy it:

Title:Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Paper and Code