Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shubham Mishra

LLMGuard: Guarding Against Unsafe LLM Behavior

Feb 27, 2024

Shubh Goyal, Medha Hira, Shubham Mishra, Sukriti Goyal, Arnav Goel, Niharika Dadu, Kirushikesh DB, Sameep Mehta, Nishtha Madaan

Figure 1 for LLMGuard: Guarding Against Unsafe LLM Behavior

Figure 2 for LLMGuard: Guarding Against Unsafe LLM Behavior

Abstract:Although the rise of Large Language Models (LLMs) in enterprise settings brings new opportunities and capabilities, it also brings challenges, such as the risk of generating inappropriate, biased, or misleading content that violates regulations and can have legal concerns. To alleviate this, we present "LLMGuard", a tool that monitors user interactions with an LLM application and flags content against specific behaviours or conversation topics. To do this robustly, LLMGuard employs an ensemble of detectors.

* accepted in demonstration track of AAAI-24

Via

Access Paper or Ask Questions

A Multimodal Corpus for Emotion Recognition in Sarcasm

Jun 05, 2022

Anupama Ray, Shubham Mishra, Apoorva Nunna, Pushpak Bhattacharyya

Figure 1 for A Multimodal Corpus for Emotion Recognition in Sarcasm

Figure 2 for A Multimodal Corpus for Emotion Recognition in Sarcasm

Figure 3 for A Multimodal Corpus for Emotion Recognition in Sarcasm

Figure 4 for A Multimodal Corpus for Emotion Recognition in Sarcasm

Abstract:While sentiment and emotion analysis have been studied extensively, the relationship between sarcasm and emotion has largely remained unexplored. A sarcastic expression may have a variety of underlying emotions. For example, "I love being ignored" belies sadness, while "my mobile is fabulous with a battery backup of only 15 minutes!" expresses frustration. Detecting the emotion behind a sarcastic expression is non-trivial yet an important task. We undertake the task of detecting the emotion in a sarcastic statement, which to the best of our knowledge, is hitherto unexplored. We start with the recently released multimodal sarcasm detection dataset (MUStARD) pre-annotated with 9 emotions. We identify and correct 343 incorrect emotion labels (out of 690). We double the size of the dataset, label it with emotions along with valence and arousal which are important indicators of emotional intensity. Finally, we label each sarcastic utterance with one of the four sarcasm types-Propositional, Embedded, Likeprefixed and Illocutionary, with the goal of advancing sarcasm detection research. Exhaustive experimentation with multimodal (text, audio, and video) fusion models establishes a benchmark for exact emotion recognition in sarcasm and outperforms the state-of-art sarcasm detection. We release the dataset enriched with various annotations and the code for research purposes: https://github.com/apoorva-nunna/MUStARD_Plus_Plus

Via

Access Paper or Ask Questions

Towards A Measure Of General Machine Intelligence

Sep 24, 2021

Gautham Venkatasubramanian, Sibesh Kar, Abhimanyu Singh, Shubham Mishra, Dushyant Yadav, Shreyansh Chandak

Figure 1 for Towards A Measure Of General Machine Intelligence

Figure 2 for Towards A Measure Of General Machine Intelligence

Figure 3 for Towards A Measure Of General Machine Intelligence

Figure 4 for Towards A Measure Of General Machine Intelligence

Abstract:To build increasingly general-purpose artificial intelligence systems that can deal with unknown variables across unknown domains, we need benchmarks that measure precisely how well these systems perform on tasks they have never seen before. A prerequisite for this is a measure of a task's generalization difficulty, or how dissimilar it is from the system's prior knowledge and experience. If the skill of an intelligence system in a particular domain is defined as it's ability to consistently generate a set of instructions (or programs) to solve tasks in that domain, current benchmarks do not quantitatively measure the efficiency of acquiring new skills, making it possible to brute-force skill acquisition by training with unlimited amounts of data and compute power. With this in mind, we first propose a common language of instruction, i.e. a programming language that allows the expression of programs in the form of directed acyclic graphs across a wide variety of real-world domains and computing platforms. Using programs generated in this language, we demonstrate a match-based method to both score performance and calculate the generalization difficulty of any given set of tasks. We use these to define a numeric benchmark called the g-index to measure and compare the skill-acquisition efficiency of any intelligence system on a set of real-world tasks. Finally, we evaluate the suitability of some well-known models as general intelligence systems by calculating their g-index scores.

* 29 pages, 12 Figures, 3 Tables Sample Dataset and Reference Code at https://github.com/mayahq/g-index-benchmark

Via

Access Paper or Ask Questions