Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuanmeng Yan

Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Oct 17, 2022

Yanan Wu, Zhiyuan Zeng, Keqing He, Yutao Mou, Pei Wang, Yuanmeng Yan, Weiran Xu

Figure 1 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Figure 2 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Figure 3 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Figure 4 for Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Abstract:Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. Traditional softmax-based confidence scores are susceptible to the overconfidence issue. In this paper, we propose a simple but strong energy-based score function to detect OOD where the energy scores of OOD samples are higher than IND samples. Further, given a small set of labeled OOD samples, we introduce an energy-based margin objective for supervised OOD detection to explicitly distinguish OOD samples from INDs. Comprehensive experiments and analysis prove our method helps disentangle confidence score distributions of IND and OOD data.\footnote{Our code is available at \url{https://github.com/pris-nlp/EMNLP2022-energy_for_OOD/}.}

* accepted by the EMNLP2022 SereTOD workshop

Via

Access Paper or Ask Questions

Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement

Sep 13, 2022

Zhen Yang, Fandong Meng, Yuanmeng Yan, Jie Zhou

Figure 1 for Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement

Figure 2 for Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement

Figure 3 for Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement

Figure 4 for Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement

Abstract:Word-level Quality Estimation (QE) of Machine Translation (MT) aims to find out potential translation errors in the translated sentence without reference. Typically, conventional works on word-level QE are designed to predict the translation quality in terms of the post-editing effort, where the word labels ("OK" and "BAD") are automatically generated by comparing words between MT sentences and the post-edited sentences through a Translation Error Rate (TER) toolkit. While the post-editing effort can be used to measure the translation quality to some extent, we find it usually conflicts with the human judgement on whether the word is well or poorly translated. To overcome the limitation, we first create a golden benchmark dataset, namely \emph{HJQE} (Human Judgement on Quality Estimation), where the expert translators directly annotate the poorly translated words on their judgements. Additionally, to further make use of the parallel corpus, we propose the self-supervised pre-training with two tag correcting strategies, namely tag refinement strategy and tree-based annotation strategy, to make the TER-based artificial QE corpus closer to \emph{HJQE}. We conduct substantial experiments based on the publicly available WMT En-De and En-Zh corpora. The results not only show our proposed dataset is more consistent with human judgment but also confirm the effectiveness of the proposed tag correcting strategies.\footnote{The data can be found at \url{https://github.com/ZhenYangIACAS/HJQE}.}

* 8 pages, 6 figures

Via

Access Paper or Ask Questions

InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER

Mar 08, 2022

Liwen Wang, Rumei Li, Yang Yan, Yuanmeng Yan, Sirui Wang, Wei Wu, Weiran Xu

Figure 1 for InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER

Figure 2 for InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER

Figure 3 for InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER

Figure 4 for InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER

Abstract:Recently, prompt-based methods have achieved significant performance in few-shot learning scenarios by bridging the gap between language model pre-training and fine-tuning for downstream tasks. However, existing prompt templates are mostly designed for sentence-level tasks and are inappropriate for sequence labeling objectives. To address the above issue, we propose a multi-task instruction-based generative framework, named InstructionNER, for low-resource named entity recognition. Specifically, we reformulate the NER task as a generation problem, which enriches source sentences with task-specific instructions and answer options, then inferences the entities and types in natural language. We further propose two auxiliary tasks, including entity extraction and entity typing, which enable the model to capture more boundary information of entities and deepen the understanding of entity type semantics, respectively. Experimental results show that our method consistently outperforms other baselines on five datasets in few-shot settings.

* Work in progress

Via

Access Paper or Ask Questions

Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Oct 07, 2021

Liwen Wang, Xuefeng Li, Jiachi Liu, Keqing He, Yuanmeng Yan, Weiran Xu

Figure 1 for Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Figure 2 for Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Figure 3 for Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Figure 4 for Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Abstract:Zero-shot cross-domain slot filling alleviates the data dependence in the case of data scarcity in the target domain, which has aroused extensive research. However, as most of the existing methods do not achieve effective knowledge transfer to the target domain, they just fit the distribution of the seen slot and show poor performance on unseen slot in the target domain. To solve this, we propose a novel approach based on prototypical contrastive learning with a dynamic label confusion strategy for zero-shot slot filling. The prototypical contrastive learning aims to reconstruct the semantic constraints of labels, and we introduce the label confusion strategy to establish the label dependence between the source domains and the target domain on-the-fly. Experimental results show that our model achieves significant improvement on the unseen slots, while also set new state-of-the-arts on slot filling task.

* Accepted by EMNLP 2021

Via

Access Paper or Ask Questions

Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

May 29, 2021

Yanan Wu, Zhiyuan Zeng, Keqing He, Hong Xu, Yuanmeng Yan, Huixing Jiang, Weiran Xu

Figure 1 for Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

Figure 2 for Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

Figure 3 for Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

Figure 4 for Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System

Abstract:Existing slot filling models can only recognize pre-defined in-domain slot types from a limited slot set. In the practical application, a reliable dialogue system should know what it does not know. In this paper, we introduce a new task, Novel Slot Detection (NSD), in the task-oriented dialogue system. NSD aims to discover unknown or out-of-domain slot types to strengthen the capability of a dialogue system based on in-domain training data. Besides, we construct two public NSD datasets, propose several strong NSD baselines, and establish a benchmark for future work. Finally, we conduct exhaustive experiments and qualitative analysis to comprehend key challenges and provide new guidance for future directions.

* ACL2021
* Accepted by ACL2021

Via

Access Paper or Ask Questions

Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning

May 29, 2021

Zhiyuan Zeng, Keqing He, Yuanmeng Yan, Zijun Liu, Yanan Wu, Hong Xu, Huixing Jiang, Weiran Xu

Figure 1 for Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning

Figure 2 for Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning

Figure 3 for Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning

Figure 4 for Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning

Abstract:Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. A key challenge of OOD detection is to learn discriminative semantic features. Traditional cross-entropy loss only focuses on whether a sample is correctly classified, and does not explicitly distinguish the margins between categories. In this paper, we propose a supervised contrastive learning objective to minimize intra-class variance by pulling together in-domain intents belonging to the same class and maximize inter-class variance by pushing apart samples from different classes. Besides, we employ an adversarial augmentation mechanism to obtain pseudo diverse views of a sample in the latent space. Experiments on two public datasets prove the effectiveness of our method capturing discriminative representations for OOD detection.

* ACL2021
* Accepted by ACL2021

Via

Access Paper or Ask Questions

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

May 25, 2021

Yuanmeng Yan, Rumei Li, Sirui Wang, Fuzheng Zhang, Wei Wu, Weiran Xu

Figure 1 for ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Figure 2 for ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Figure 3 for ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Figure 4 for ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Abstract:Learning high-quality sentence representations benefits a wide range of natural language processing tasks. Though BERT-based pre-trained language models achieve high performance on many downstream tasks, the native derived sentence representations are proved to be collapsed and thus produce a poor performance on the semantic textual similarity (STS) tasks. In this paper, we present ConSERT, a Contrastive Framework for Self-Supervised Sentence Representation Transfer, that adopts contrastive learning to fine-tune BERT in an unsupervised and effective way. By making use of unlabeled texts, ConSERT solves the collapse issue of BERT-derived sentence representations and make them more applicable for downstream tasks. Experiments on STS datasets demonstrate that ConSERT achieves an 8\% relative improvement over the previous state-of-the-art, even comparable to the supervised SBERT-NLI. And when further incorporating NLI supervision, we achieve new state-of-the-art performance on STS tasks. Moreover, ConSERT obtains comparable results with only 1000 samples available, showing its robustness in data scarcity scenarios.

* Accepted by ACL2021, 10 pages, 7 figures, 4 tables

Via

Access Paper or Ask Questions