Picture for Supriti Vijay

Supriti Vijay

When Neutral Summaries are not that Neutral: Quantifying Political Neutrality in LLM-Generated News Summaries

Add code
Oct 13, 2024
Viaarxiv icon

FRACTURED-SORRY-Bench: Framework for Revealing Attacks in Conversational Turns Undermining Refusal Efficacy and Defenses over SORRY-Bench

Add code
Aug 28, 2024
Viaarxiv icon

Counterfactual Explanation Policies in RL

Add code
Jul 25, 2023
Viaarxiv icon

Are Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitization

Add code
May 24, 2023
Viaarxiv icon

#maskUp: Selective Attribute Encryption for Sensitive Vocalization for English language on Social Media Platforms

Add code
Nov 16, 2022
Viaarxiv icon

AdaptKeyBERT: An Attention-Based approach towards Few-Shot & Zero-Shot Domain Adaptation of KeyBERT

Add code
Nov 16, 2022
Viaarxiv icon

NERDA-Con: Extending NER models for Continual Learning -- Integrating Distinct Tasks and Updating Distribution Shifts

Add code
Jun 28, 2022
Figure 1 for NERDA-Con: Extending NER models for Continual Learning -- Integrating Distinct Tasks and Updating Distribution Shifts
Figure 2 for NERDA-Con: Extending NER models for Continual Learning -- Integrating Distinct Tasks and Updating Distribution Shifts
Figure 3 for NERDA-Con: Extending NER models for Continual Learning -- Integrating Distinct Tasks and Updating Distribution Shifts
Figure 4 for NERDA-Con: Extending NER models for Continual Learning -- Integrating Distinct Tasks and Updating Distribution Shifts
Viaarxiv icon

ExCode-Mixed: Explainable Approaches towards Sentiment Analysis on Code-Mixed Data using BERT models

Add code
Sep 25, 2021
Figure 1 for ExCode-Mixed: Explainable Approaches towards Sentiment Analysis on Code-Mixed Data using BERT models
Viaarxiv icon