Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ayush Pareek

Measuring Bias in Contextualized Word Representations

Jun 18, 2019

Keita Kurita, Nidhi Vyas, Ayush Pareek, Alan W Black, Yulia Tsvetkov

Figure 1 for Measuring Bias in Contextualized Word Representations

Figure 2 for Measuring Bias in Contextualized Word Representations

Figure 3 for Measuring Bias in Contextualized Word Representations

Figure 4 for Measuring Bias in Contextualized Word Representations

Abstract:Contextual word embeddings such as BERT have achieved state of the art performance in numerous NLP tasks. Since they are optimized to capture the statistical properties of training data, they tend to pick up on and amplify social stereotypes present in the data as well. In this study, we (1)~propose a template-based method to quantify bias in BERT; (2)~show that this method obtains more consistent results in capturing social biases than the traditional cosine based method; and (3)~conduct a case study, evaluating gender bias in a downstream task of Gender Pronoun Resolution. Although our case study focuses on gender bias, the proposed technique is generalizable to unveiling other biases, including in multiclass settings, such as racial and religious biases.

* 1st ACL Workshop on Gender Bias for Natural Language Processing 2019

Via

Access Paper or Ask Questions

Graph-based Neural Multi-Document Summarization

Aug 23, 2017

Michihiro Yasunaga, Rui Zhang, Kshitijh Meelu, Ayush Pareek, Krishnan Srinivasan, Dragomir Radev

Figure 1 for Graph-based Neural Multi-Document Summarization

Figure 2 for Graph-based Neural Multi-Document Summarization

Figure 3 for Graph-based Neural Multi-Document Summarization

Figure 4 for Graph-based Neural Multi-Document Summarization

Abstract:We propose a neural multi-document summarization (MDS) system that incorporates sentence relation graphs. We employ a Graph Convolutional Network (GCN) on the relation graphs, with sentence embeddings obtained from Recurrent Neural Networks as input node features. Through multiple layer-wise propagation, the GCN generates high-level hidden sentence features for salience estimation. We then use a greedy heuristic to extract salient sentences while avoiding redundancy. In our experiments on DUC 2004, we consider three types of sentence relation graphs and demonstrate the advantage of combining sentence relations in graphs with the representation power of deep neural networks. Our model improves upon traditional graph-based extractive approaches and the vanilla GRU sequence model with no graph, and it achieves competitive results against other state-of-the-art multi-document summarization systems.

* In CoNLL 2017

Via

Access Paper or Ask Questions