Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haoda Chu

Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

Sep 11, 2021

Yaqing Wang, Haoda Chu, Chao Zhang, Jing Gao

Figure 1 for Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

Figure 2 for Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

Figure 3 for Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

Figure 4 for Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework

Abstract:In this work, we study the problem of named entity recognition (NER) in a low resource scenario, focusing on few-shot and zero-shot settings. Built upon large-scale pre-trained language models, we propose a novel NER framework, namely SpanNER, which learns from natural language supervision and enables the identification of never-seen entity classes without using in-domain labeled data. We perform extensive experiments on 5 benchmark datasets and evaluate the proposed method in the few-shot learning, domain transfer and zero-shot learning settings. The experimental results show that the proposed method can bring 10%, 23% and 26% improvements in average over the best baselines in few-shot learning, domain transfer and zero-shot learning settings respectively.

* EMNLP 2021 Findings

Via

Access Paper or Ask Questions

Adaptive Self-training for Few-shot Neural Sequence Labeling

Oct 07, 2020

Yaqing Wang, Subhabrata Mukherjee, Haoda Chu, Yuancheng Tu, Ming Wu, Jing Gao, Ahmed Hassan Awadallah

Figure 1 for Adaptive Self-training for Few-shot Neural Sequence Labeling

Figure 2 for Adaptive Self-training for Few-shot Neural Sequence Labeling

Figure 3 for Adaptive Self-training for Few-shot Neural Sequence Labeling

Figure 4 for Adaptive Self-training for Few-shot Neural Sequence Labeling

Abstract:Neural sequence labeling is an important technique employed for many Natural Language Processing (NLP) tasks, such as Named Entity Recognition (NER), slot tagging for dialog systems and semantic parsing. Large-scale pre-trained language models obtain very good performance on these tasks when fine-tuned on large amounts of task-specific labeled data. However, such large-scale labeled datasets are difficult to obtain for several tasks and domains due to the high cost of human annotation as well as privacy and data access constraints for sensitive user applications. This is exacerbated for sequence labeling tasks requiring such annotations at token-level. In this work, we develop techniques to address the label scarcity challenge for neural sequence labeling models. Specifically, we develop self-training and meta-learning techniques for few-shot training of neural sequence taggers, namely MetaST. While self-training serves as an effective mechanism to learn from large amounts of unlabeled data -- meta-learning helps in adaptive sample re-weighting to mitigate error propagation from noisy pseudo-labels. Extensive experiments on six benchmark datasets including two massive multilingual NER datasets and four slot tagging datasets for task-oriented dialog systems demonstrate the effectiveness of our method with around 10% improvement over state-of-the-art systems for the 10-shot setting.

Via

Access Paper or Ask Questions