Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan H. Morgan

When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People?

Apr 25, 2020

Kenneth Joseph, Jonathan H. Morgan

Figure 1 for When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People?

Figure 2 for When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People?

Figure 3 for When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People?

Figure 4 for When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People?

Abstract:Social biases are encoded in word embeddings. This presents a unique opportunity to study society historically and at scale, and a unique danger when embeddings are used in downstream applications. Here, we investigate the extent to which publicly-available word embeddings accurately reflect beliefs about certain kinds of people as measured via traditional survey methods. We find that biases found in word embeddings do, on average, closely mirror survey data across seventeen dimensions of social meaning. However, we also find that biases in embeddings are much more reflective of survey data for some dimensions of meaning (e.g. gender) than others (e.g. race), and that we can be highly confident that embedding-based measures reflect survey data only for the most salient biases.

* Accepted at ACL2020

Via

Access Paper or Ask Questions