Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luyao Huang

GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge

Aug 20, 2019

Luyao Huang, Chi Sun, Xipeng Qiu, Xuanjing Huang

Figure 1 for GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge

Figure 2 for GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge

Figure 3 for GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge

Abstract:Word Sense Disambiguation (WSD) aims to find the exact sense of an ambiguous word in a particular context. Traditional supervised methods rarely take into consideration the lexical resources like WordNet, which are widely utilized in knowledge-based methods. Recent studies have shown the effectiveness of incorporating gloss (sense definition) into neural networks for WSD. However, compared with traditional word expert supervised methods, they have not achieved much improvement. In this paper, we focus on how to better leverage gloss knowledge in a supervised neural WSD system. We construct context-gloss pairs and propose three BERT-based models for WSD. We fine-tune the pre-trained BERT model and achieve new state-of-the-art results on WSD task.

* EMNLP-IJCNLP 2019

Via

Access Paper or Ask Questions

DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

Jul 26, 2019

Lin Zehui, Pengfei Liu, Luyao Huang, Junkun Chen, Xipeng Qiu, Xuanjing Huang

Figure 1 for DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

Figure 2 for DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

Figure 3 for DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

Figure 4 for DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

Abstract:Variants dropout methods have been designed for the fully-connected layer, convolutional layer and recurrent layer in neural networks, and shown to be effective to avoid overfitting. As an appealing alternative to recurrent and convolutional layers, the fully-connected self-attention layer surprisingly lacks a specific dropout method. This paper explores the possibility of regularizing the attention weights in Transformers to prevent different contextualized feature vectors from co-adaption. Experiments on a wide range of tasks show that DropAttention can improve performance and reduce overfitting.

Via

Access Paper or Ask Questions

Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence

Mar 22, 2019

Chi Sun, Luyao Huang, Xipeng Qiu

Figure 1 for Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence

Figure 2 for Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence

Figure 3 for Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence

Figure 4 for Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence

Abstract:Aspect-based sentiment analysis (ABSA), which aims to identify fine-grained opinion polarity towards a specific aspect, is a challenging subtask of sentiment analysis (SA). In this paper, we construct an auxiliary sentence from the aspect and convert ABSA to a sentence-pair classification task, such as question answering (QA) and natural language inference (NLI). We fine-tune the pre-trained model from BERT and achieve new state-of-the-art results on SentiHood and SemEval-2014 Task 4 datasets.

* Accepted to NAACL 2019

Via

Access Paper or Ask Questions