Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vinayshekhar Bannihatti Kumar

Privacy Adhering Machine Un-learning in NLP

Dec 19, 2022

Vinayshekhar Bannihatti Kumar, Rashmi Gangadharaiah, Dan Roth

Figure 1 for Privacy Adhering Machine Un-learning in NLP

Figure 2 for Privacy Adhering Machine Un-learning in NLP

Figure 3 for Privacy Adhering Machine Un-learning in NLP

Figure 4 for Privacy Adhering Machine Un-learning in NLP

Abstract:Regulations introduced by General Data Protection Regulation (GDPR) in the EU or California Consumer Privacy Act (CCPA) in the US have included provisions on the \textit{right to be forgotten} that mandates industry applications to remove data related to an individual from their systems. In several real world industry applications that use Machine Learning to build models on user data, such mandates require significant effort both in terms of data cleansing as well as model retraining while ensuring the models do not deteriorate in prediction quality due to removal of data. As a result, continuous removal of data and model retraining steps do not scale if these applications receive such requests at a very high frequency. Recently, a few researchers proposed the idea of \textit{Machine Unlearning} to tackle this challenge. Despite the significant importance of this task, the area of Machine Unlearning is under-explored in Natural Language Processing (NLP) tasks. In this paper, we explore the Unlearning framework on various GLUE tasks \cite{Wang:18}, such as, QQP, SST and MNLI. We propose computationally efficient approaches (SISA-FC and SISA-A) to perform \textit{guaranteed} Unlearning that provides significant reduction in terms of both memory (90-95\%), time (100x) and space consumption (99\%) in comparison to the baselines while keeping model performance constant.

Via

Access Paper or Ask Questions

Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters

Oct 07, 2022

Vinayshekhar Bannihatti Kumar, Rashmi Gangadharaiah, Dan Roth

Figure 1 for Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters

Figure 2 for Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters

Figure 3 for Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters

Figure 4 for Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters

Abstract:Research has shown that personality is a key driver to improve engagement and user experience in conversational systems. Conversational agents should also maintain a consistent persona to have an engaging conversation with a user. However, text generation datasets are often crowd sourced and thereby have an averaging effect where the style of the generation model is an average style of all the crowd workers that have contributed to the dataset. While one can collect persona-specific datasets for each task, it would be an expensive and time consuming annotation effort. In this work, we propose a novel transfer learning framework which updates only $0.3\%$ of model parameters to learn style specific attributes for response generation. For the purpose of this study, we tackle the problem of stylistic story ending generation using the ROC stories Corpus. We learn style specific attributes from the PERSONALITY-CAPTIONS dataset. Through extensive experiments and evaluation metrics we show that our novel training procedure can improve the style generation by 200 over Encoder-Decoder baselines while maintaining on-par content relevance metrics with

Via

Access Paper or Ask Questions

Dr.Quad at MEDIQA 2019: Towards Textual Inference and Question Entailment using contextualized representations

Jul 23, 2019

Vinayshekhar Bannihatti Kumar, Ashwin Srinivasan, Aditi Chaudhary, James Route, Teruko Mitamura, Eric Nyberg

Figure 1 for Dr.Quad at MEDIQA 2019: Towards Textual Inference and Question Entailment using contextualized representations

Figure 2 for Dr.Quad at MEDIQA 2019: Towards Textual Inference and Question Entailment using contextualized representations

Figure 3 for Dr.Quad at MEDIQA 2019: Towards Textual Inference and Question Entailment using contextualized representations

Figure 4 for Dr.Quad at MEDIQA 2019: Towards Textual Inference and Question Entailment using contextualized representations

Abstract:This paper presents the submissions by Team Dr.Quad to the ACL-BioNLP 2019 shared task on Textual Inference and Question Entailment in the Medical Domain. Our system is based on the prior work Liu et al. (2019) which uses a multi-task objective function for textual entailment. In this work, we explore different strategies for generalizing state-of-the-art language understanding models to the specialized medical domain. Our results on the shared task demonstrate that incorporating domain knowledge through data augmentation is a powerful strategy for addressing challenges posed by specialized domains such as medicine.

* Accepted in ACL challenge MediQA as part of the BioNLP workshop

Via

Access Paper or Ask Questions

WriterForcing: Generating more interesting story endings

Jul 18, 2019

Prakhar Gupta, Vinayshekhar Bannihatti Kumar, Mukul Bhutani, Alan W Black

Figure 1 for WriterForcing: Generating more interesting story endings

Figure 2 for WriterForcing: Generating more interesting story endings

Figure 3 for WriterForcing: Generating more interesting story endings

Figure 4 for WriterForcing: Generating more interesting story endings

Abstract:We study the problem of generating interesting endings for stories. Neural generative models have shown promising results for various text generation problems. Sequence to Sequence (Seq2Seq) models are typically trained to generate a single output sequence for a given input sequence. However, in the context of a story, multiple endings are possible. Seq2Seq models tend to ignore the context and generate generic and dull responses. Very few works have studied generating diverse and interesting story endings for a given story context. In this paper, we propose models which generate more diverse and interesting outputs by 1) training models to focus attention on important keyphrases of the story, and 2) promoting generation of non-generic words. We show that the combination of the two leads to more diverse and interesting endings.

* Accepted in ACL workshop on Storytelling 2019

Via

Access Paper or Ask Questions