Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zifan Qian

Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch

Sep 21, 2024

Jinman Zhao, Xueyan Zhang, Xingyu Yue, Weizhe Chen, Zifan Qian, Ruiyu Wang

Figure 1 for Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch

Figure 2 for Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch

Figure 3 for Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch

Figure 4 for Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch

Abstract:Current common interactions with language models is through full inference. This approach may not necessarily align with the model's internal knowledge. Studies show discrepancies between prompts and internal representations. Most focus on sentence understanding. We study the discrepancy of word semantics understanding in internal and external mismatch across Encoder-only, Decoder-only, and Encoder-Decoder pre-trained language models.

* 10 pages, 1 figure, 5 tables

Via

Access Paper or Ask Questions

Bias and Toxicity in Role-Play Reasoning

Sep 21, 2024

Jinman Zhao, Zifan Qian, Linbo Cao, Yining Wang, Yitian Ding

Figure 1 for Bias and Toxicity in Role-Play Reasoning

Figure 2 for Bias and Toxicity in Role-Play Reasoning

Figure 3 for Bias and Toxicity in Role-Play Reasoning

Figure 4 for Bias and Toxicity in Role-Play Reasoning

Abstract:Role-play in the Large Language Model (LLM) is a crucial technique that enables models to adopt specific perspectives, enhancing their ability to generate contextually relevant and accurate responses. By simulating different roles, theis approach improves reasoning capabilities across various NLP benchmarks, making the model's output more aligned with diverse scenarios. However, in this work, we demonstrate that role-play also carries potential risks. We systematically evaluate the impact of role-play by asking the language model to adopt different roles and testing it on multiple benchmarks that contain stereotypical and harmful questions. Despite the significant fluctuations in the benchmark results in different experiments, we find that applying role-play often increases the overall likelihood of generating stereotypical and harmful outputs.

* 14 pages, 9 figures, 9 tables

Via

Access Paper or Ask Questions

Gender Bias in Large Language Models across Multiple Languages

Mar 01, 2024

Jinman Zhao, Yitian Ding, Chen Jia, Yining Wang, Zifan Qian

Figure 1 for Gender Bias in Large Language Models across Multiple Languages

Figure 2 for Gender Bias in Large Language Models across Multiple Languages

Figure 3 for Gender Bias in Large Language Models across Multiple Languages

Figure 4 for Gender Bias in Large Language Models across Multiple Languages

Abstract:With the growing deployment of large language models (LLMs) across various applications, assessing the influence of gender biases embedded in LLMs becomes crucial. The topic of gender bias within the realm of natural language processing (NLP) has gained considerable focus, particularly in the context of English. Nonetheless, the investigation of gender bias in languages other than English is still relatively under-explored and insufficiently analyzed. In this work, We examine gender bias in LLMs-generated outputs for different languages. We use three measurements: 1) gender bias in selecting descriptive words given the gender-related context. 2) gender bias in selecting gender-related pronouns (she/he) given the descriptive words. 3) gender bias in the topics of LLM-generated dialogues. We investigate the outputs of the GPT series of LLMs in various languages using our three measurement methods. Our findings revealed significant gender biases across all the languages we examined.

* 20 pages, 27 tables, 7 figures, submitted to ACL2024

Via

Access Paper or Ask Questions