Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kinshuk Vasisht

Knowledge Graph Guided Evaluation of Abstention Techniques

Dec 10, 2024

Kinshuk Vasisht, Navreet Kaur, Danish Pruthi

Figure 1 for Knowledge Graph Guided Evaluation of Abstention Techniques

Figure 2 for Knowledge Graph Guided Evaluation of Abstention Techniques

Figure 3 for Knowledge Graph Guided Evaluation of Abstention Techniques

Figure 4 for Knowledge Graph Guided Evaluation of Abstention Techniques

Abstract:To deploy language models safely, it is crucial that they abstain from responding to inappropriate requests. Several prior studies test the safety promises of models based on their effectiveness in blocking malicious requests. In this work, we focus on evaluating the underlying techniques that cause models to abstain. We create SELECT, a benchmark derived from a set of benign concepts (e.g., "rivers") from a knowledge graph. The nature of SELECT enables us to isolate the effects of abstention techniques from other safety training procedures, as well as evaluate their generalization and specificity. Using SELECT, we benchmark different abstention techniques over six open-weight and closed-source models. We find that the examined techniques indeed cause models to abstain with over $80\%$ abstention rates. However, these techniques are not as effective for descendants of the target concepts, with refusal rates declining by $19\%$. We also characterize the generalization-vs-specificity trade-offs for different techniques. Overall, no single technique is invariably better than the others. Our findings call for a careful evaluation of different aspects of abstention, and hopefully inform practitioners of various trade-offs involved.

Via

Access Paper or Ask Questions

Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations

Nov 11, 2024

Kirti Bhagat, Kinshuk Vasisht, Danish Pruthi

Figure 1 for Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations

Figure 2 for Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations

Figure 3 for Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations

Figure 4 for Richer Output for Richer Countries: Uncovering Geographical Disparities in Generated Stories and Travel Recommendations

Abstract:While a large body of work inspects language models for biases concerning gender, race, occupation and religion, biases of geographical nature are relatively less explored. Some recent studies benchmark the degree to which large language models encode geospatial knowledge. However, the impact of the encoded geographical knowledge (or lack thereof) on real-world applications has not been documented. In this work, we examine large language models for two common scenarios that require geographical knowledge: (a) travel recommendations and (b) geo-anchored story generation. Specifically, we study four popular language models, and across about $100$K travel requests, and $200$K story generations, we observe that travel recommendations corresponding to poorer countries are less unique with fewer location references, and stories from these regions more often convey emotions of hardship and sadness compared to those from wealthier nations.

* Submitted to ARR - October 2024

Via

Access Paper or Ask Questions

Infusing Knowledge into Large Language Models with Contextual Prompts

Mar 03, 2024

Kinshuk Vasisht, Balaji Ganesan, Vikas Kumar, Vasudha Bhatnagar

Abstract:Knowledge infusion is a promising method for enhancing Large Language Models for domain-specific NLP tasks rather than pre-training models over large data from scratch. These augmented LLMs typically depend on additional pre-training or knowledge prompts from an existing knowledge graph, which is impractical in many applications. In contrast, knowledge infusion directly from relevant documents is more generalisable and alleviates the need for structured knowledge graphs while also being useful for entities that are usually not found in any knowledge graph. With this motivation, we propose a simple yet generalisable approach for knowledge infusion by generating prompts from the context in the input text. Our experiments show the effectiveness of our approach which we evaluate by probing the fine-tuned LLMs.

* 5 pages, 1 figure, In Proceedings of ICON 2023

Via

Access Paper or Ask Questions