Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amita Misra

Controlled Text Generation with Hidden Representation Transformations

May 31, 2023

Vaibhav Kumar, Hana Koorehdavoudi, Masud Moshtaghi, Amita Misra, Ankit Chadha, Emilio Ferrara

Figure 1 for Controlled Text Generation with Hidden Representation Transformations

Figure 2 for Controlled Text Generation with Hidden Representation Transformations

Figure 3 for Controlled Text Generation with Hidden Representation Transformations

Figure 4 for Controlled Text Generation with Hidden Representation Transformations

Abstract:We propose CHRT (Control Hidden Representation Transformation) - a controlled language generation framework that steers large language models to generate text pertaining to certain attributes (such as toxicity). CHRT gains attribute control by modifying the hidden representation of the base model through learned transformations. We employ a contrastive-learning framework to learn these transformations that can be combined to gain multi-attribute control. The effectiveness of CHRT is experimentally shown by comparing it with seven baselines over three attributes. CHRT outperforms all the baselines in the task of detoxification, positive sentiment steering, and text simplification while minimizing the loss in linguistic qualities. Further, our approach has the lowest inference latency of only 0.01 seconds more than the base model, making it the most suitable for high-performance production environments. We open-source our code and release two novel datasets to further propel controlled language generation research.

* Accepted at ACL 2023 as a long paper (Findings)

Via

Access Paper or Ask Questions

Machine Translation Impact in E-commerce Multilingual Search

Jan 31, 2023

Bryan Zhang, Amita Misra

Abstract:Previous work suggests that performance of cross-lingual information retrieval correlates highly with the quality of Machine Translation. However, there may be a threshold beyond which improving query translation quality yields little or no benefit to further improve the retrieval performance. This threshold may depend upon multiple factors including the source and target languages, the existing MT system quality and the search pipeline. In order to identify the benefit of improving an MT system for a given search pipeline, we investigate the sensitivity of retrieval quality to the presence of different levels of MT quality using experimental datasets collected from actual traffic. We systematically improve the performance of our MT systems quality on language pairs as measured by MT evaluation metrics including Bleu and Chrf to determine their impact on search precision metrics and extract signals that help to guide the improvement strategies. Using this information we develop techniques to compare query translations for multiple language pairs and identify the most promising language pairs to invest and improve.

* Accepted by EMNLP 2022 (Industry Track)

Via

Access Paper or Ask Questions

Accountable Error Characterization

May 10, 2021

Amita Misra, Zhe Liu, Jalal Mahmud

Figure 1 for Accountable Error Characterization

Figure 2 for Accountable Error Characterization

Figure 3 for Accountable Error Characterization

Figure 4 for Accountable Error Characterization

Abstract:Customers of machine learning systems demand accountability from the companies employing these algorithms for various prediction tasks. Accountability requires understanding of system limit and condition of erroneous predictions, as customers are often interested in understanding the incorrect predictions, and model developers are absorbed in finding methods that can be used to get incremental improvements to an existing system. Therefore, we propose an accountable error characterization method, AEC, to understand when and where errors occur within the existing black-box models. AEC, as constructed with human-understandable linguistic features, allows the model developers to automatically identify the main sources of errors for a given classification system. It can also be used to sample for the set of most informative input points for a next round of training. We perform error detection for a sentiment analysis task using AEC as a case study. Our results on the sample sentiment task show that AEC is able to characterize erroneous predictions into human understandable categories and also achieves promising results on selecting erroneous samples when compared with the uncertainty-based sampling.

* Proceedings of the First Workshop on Trustworthy Natural Language Processing, TrustNLP@NAACL-HLT 2021, June 10, 2021, Association for Computational Linguistics, 2021

Via

Access Paper or Ask Questions

Teacher-Student Learning Paradigm for Tri-training: An Efficient Method for Unlabeled Data Exploitation

Sep 25, 2019

Yash Bhalgat, Zhe Liu, Pritam Gundecha, Jalal Mahmud, Amita Misra

Figure 1 for Teacher-Student Learning Paradigm for Tri-training: An Efficient Method for Unlabeled Data Exploitation

Figure 2 for Teacher-Student Learning Paradigm for Tri-training: An Efficient Method for Unlabeled Data Exploitation

Figure 3 for Teacher-Student Learning Paradigm for Tri-training: An Efficient Method for Unlabeled Data Exploitation

Abstract:Given that labeled data is expensive to obtain in real-world scenarios, many semi-supervised algorithms have explored the task of exploitation of unlabeled data. Traditional tri-training algorithm and tri-training with disagreement have shown promise in tasks where labeled data is limited. In this work, we introduce a new paradigm for tri-training, mimicking the real world teacher-student learning process. We show that the adaptive teacher-student thresholds used in the proposed method provide more control over the learning process with higher label quality. We perform evaluation on SemEval sentiment analysis task and provide comprehensive comparisons over experimental settings containing varied labeled versus unlabeled data rates. Experimental results show that our method outperforms other strong semi-supervised baselines, while requiring less number of labeled training samples.

Via

Access Paper or Ask Questions

Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations

Jun 11, 2019

Amita Misra, Mansurul Bhuiyan, Jalal Mahmud, Saurabh Tripathy

Figure 1 for Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations

Figure 2 for Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations

Figure 3 for Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations

Figure 4 for Using Structured Representation and Data: A Hybrid Model for Negation and Sentiment in Customer Service Conversations

Abstract:Twitter customer service interactions have recently emerged as an effective platform to respond and engage with customers. In this work, we explore the role of negation in customer service interactions, particularly applied to sentiment analysis. We define rules to identify true negation cues and scope more suited to conversational data than existing general review data. Using semantic knowledge and syntactic structure from constituency parse trees, we propose an algorithm for scope detection that performs comparable to state of the art BiLSTM. We further investigate the results of negation scope detection for the sentiment prediction task on customer service conversation data using both a traditional SVM and a Neural Network. We propose an antonym dictionary based method for negation applied to a CNN-LSTM combination model for sentiment analysis. Experimental results show that the antonym-based method outperforms the previous lexicon-based and neural network methods.

* Proceedings of the 10th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 2019

Via

Access Paper or Ask Questions

Don't get Lost in Negation: An Effective Negation Handled Dialogue Acts Prediction Algorithm for Twitter Customer Service Conversations

Jul 16, 2018

Mansurul Bhuiyan, Amita Misra, Saurabh Tripathy, Jalal Mahmud, Rama Akkiraju

Figure 1 for Don't get Lost in Negation: An Effective Negation Handled Dialogue Acts Prediction Algorithm for Twitter Customer Service Conversations

Figure 2 for Don't get Lost in Negation: An Effective Negation Handled Dialogue Acts Prediction Algorithm for Twitter Customer Service Conversations

Figure 3 for Don't get Lost in Negation: An Effective Negation Handled Dialogue Acts Prediction Algorithm for Twitter Customer Service Conversations

Figure 4 for Don't get Lost in Negation: An Effective Negation Handled Dialogue Acts Prediction Algorithm for Twitter Customer Service Conversations

Abstract:In the last several years, Twitter is being adopted by the companies as an alternative platform to interact with the customers to address their concerns. With the abundance of such unconventional conversation resources, push for developing effective virtual agents is more than ever. To address this challenge, a better understanding of such customer service conversations is required. Lately, there have been several works proposing a novel taxonomy for fine-grained dialogue acts as well as develop algorithms for automatic detection of these acts. The outcomes of these works are providing stepping stones for the ultimate goal of building efficient and effective virtual agents. But none of these works consider handling the notion of negation into the proposed algorithms. In this work, we developed an SVM-based dialogue acts prediction algorithm for Twitter customer service conversations where negation handling is an integral part of the end-to-end solution. For negation handling, we propose several efficient heuristics as well as adopt recent state-of- art third party machine learning based solutions. Empirically we show model's performance gain while handling negation compared to when we don't. Our experiments show that for the informal text such as tweets, the heuristic-based approach is more effective.

Via

Access Paper or Ask Questions

SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems

May 10, 2018

Kevin K. Bowden, Jiaqi Wu, Shereen Oraby, Amita Misra, Marilyn Walker

Figure 1 for SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems

Figure 2 for SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems

Figure 3 for SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems

Figure 4 for SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems

Abstract:In dialogue systems, the tasks of named entity recognition (NER) and named entity linking (NEL) are vital preprocessing steps for understanding user intent, especially in open domain interaction where we cannot rely on domain-specific inference. UCSC's effort as one of the funded teams in the 2017 Amazon Alexa Prize Contest has yielded Slugbot, an open domain social bot, aimed at casual conversation. We discovered several challenges specifically associated with both NER and NEL when building Slugbot, such as that the NE labels are too coarse-grained or the entity types are not linked to a useful ontology. Moreover, we have discovered that traditional approaches do not perform well in our context: even systems designed to operate on tweets or other social media data do not work well in dialogue systems. In this paper, we introduce Slugbot's Named Entity Recognition for dialogue Systems (SlugNERDS), a NER and NEL tool which is optimized to address these issues. We describe two new resources that we are building as part of this work: SlugEntityDB and SchemaActuator. We believe these resources will be useful for the research community.

* Kevin K. Bowden, Jiaqi Wu, Shereen Oraby, Amita Misra, and Marilyn Walker. SlugNERDS: A Named Entity Recognition Tool for Open Domain Dialogue Systems. Language Resources and Evaluation Conference (LREC), Miyazaki, Japan, 2018
* Resources can be found: https://nlds.soe.ucsc.edu/node/56

Via

Access Paper or Ask Questions

Slugbot: An Application of a Novel and Scalable Open Domain Socialbot Framework

Jan 04, 2018

Kevin K. Bowden, Jiaqi Wu, Shereen Oraby, Amita Misra, Marilyn Walker

Figure 1 for Slugbot: An Application of a Novel and Scalable Open Domain Socialbot Framework

Figure 2 for Slugbot: An Application of a Novel and Scalable Open Domain Socialbot Framework

Figure 3 for Slugbot: An Application of a Novel and Scalable Open Domain Socialbot Framework

Figure 4 for Slugbot: An Application of a Novel and Scalable Open Domain Socialbot Framework

Abstract:In this paper we introduce a novel, open domain socialbot for the Amazon Alexa Prize competition, aimed at carrying on friendly conversations with users on a variety of topics. We present our modular system, highlighting our different data sources and how we use the human mind as a model for data management. Additionally we build and employ natural language understanding and information retrieval tools and APIs to expand our knowledge bases. We describe our semistructured, scalable framework for crafting topic-specific dialogue flows, and give details on our dialogue management schemes and scoring mechanisms. Finally we briefly evaluate the performance of our system and observe the challenges that an open domain socialbot faces.

* Alexa Prize Proceedings 2017

Via

Access Paper or Ask Questions

Summarizing Dialogic Arguments from Social Media

Oct 31, 2017

Amita Misra, Shereen Oraby, Shubhangi Tandon, Sharath TS, Pranav Anand, Marilyn Walker

Figure 1 for Summarizing Dialogic Arguments from Social Media

Figure 2 for Summarizing Dialogic Arguments from Social Media

Figure 3 for Summarizing Dialogic Arguments from Social Media

Figure 4 for Summarizing Dialogic Arguments from Social Media

Abstract:Online argumentative dialog is a rich source of information on popular beliefs and opinions that could be useful to companies as well as governmental or public policy agencies. Compact, easy to read, summaries of these dialogues would thus be highly valuable. A priori, it is not even clear what form such a summary should take. Previous work on summarization has primarily focused on summarizing written texts, where the notion of an abstract of the text is well defined. We collect gold standard training data consisting of five human summaries for each of 161 dialogues on the topics of Gay Marriage, Gun Control and Abortion. We present several different computational models aimed at identifying segments of the dialogues whose content should be used for the summary, using linguistic features and Word2vec features with both SVMs and Bidirectional LSTMs. We show that we can identify the most important arguments by using the dialog context with a best F-measure of 0.74 for gun control, 0.71 for gay marriage, and 0.67 for abortion.

* Proceedings of the 21th Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2017)

Via

Access Paper or Ask Questions

Combining Search with Structured Data to Create a More Engaging User Experience in Open Domain Dialogue

Sep 15, 2017

Kevin K. Bowden, Shereen Oraby, Jiaqi Wu, Amita Misra, Marilyn Walker

Figure 1 for Combining Search with Structured Data to Create a More Engaging User Experience in Open Domain Dialogue

Figure 2 for Combining Search with Structured Data to Create a More Engaging User Experience in Open Domain Dialogue

Figure 3 for Combining Search with Structured Data to Create a More Engaging User Experience in Open Domain Dialogue

Figure 4 for Combining Search with Structured Data to Create a More Engaging User Experience in Open Domain Dialogue

Abstract:The greatest challenges in building sophisticated open-domain conversational agents arise directly from the potential for ongoing mixed-initiative multi-turn dialogues, which do not follow a particular plan or pursue a particular fixed information need. In order to make coherent conversational contributions in this context, a conversational agent must be able to track the types and attributes of the entities under discussion in the conversation and know how they are related. In some cases, the agent can rely on structured information sources to help identify the relevant semantic relations and produce a turn, but in other cases, the only content available comes from search, and it may be unclear which semantic relations hold between the search results and the discourse context. A further constraint is that the system must produce its contribution to the ongoing conversation in real-time. This paper describes our experience building SlugBot for the 2017 Alexa Prize, and discusses how we leveraged search and structured data from different sources to help SlugBot produce dialogic turns and carry on conversations whose length over the semi-finals user evaluation period averaged 8:17 minutes.

* SCAI 2017

Via

Access Paper or Ask Questions