Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrej Zukov-Gregoric

YELM: End-to-End Contextualized Entity Linking

Nov 10, 2019

Haotian Chen, Sahil Wadhwa, Xi David Li, Andrej Zukov-Gregoric

Figure 1 for YELM: End-to-End Contextualized Entity Linking

Figure 2 for YELM: End-to-End Contextualized Entity Linking

Abstract:We propose yet another entity linking model (YELM) which links words to entities instead of spans. This overcomes any difficulties associated with the selection of good candidate mention spans and makes the joint training of mention detection (MD) and entity disambiguation (ED) easily possible. Our model is based on BERT and produces contextualized word embeddings which are trained against a joint MD and ED objective. We achieve state-of-the-art results on several standard entity linking (EL) datasets.

* 5 pages, 2 tables

Via

Access Paper or Ask Questions

An Attention Mechanism for Answer Selection Using a Combined Global and Local View

Sep 20, 2017

Yoram Bachrach, Andrej Zukov-Gregoric, Sam Coope, Ed Tovell, Bogdan Maksak, Jose Rodriguez, Conan McMurtie

Figure 1 for An Attention Mechanism for Answer Selection Using a Combined Global and Local View

Figure 2 for An Attention Mechanism for Answer Selection Using a Combined Global and Local View

Figure 3 for An Attention Mechanism for Answer Selection Using a Combined Global and Local View

Figure 4 for An Attention Mechanism for Answer Selection Using a Combined Global and Local View

Abstract:We propose a new attention mechanism for neural based question answering, which depends on varying granularities of the input. Previous work focused on augmenting recurrent neural networks with simple attention mechanisms which are a function of the similarity between a question embedding and an answer embeddings across time. We extend this by making the attention mechanism dependent on a global embedding of the answer attained using a separate network. We evaluate our system on InsuranceQA, a large question answering dataset. Our model outperforms current state-of-the-art results on InsuranceQA. Further, we visualize which sections of text our attention mechanism focuses on, and explore its performance across different parameter settings.

Via

Access Paper or Ask Questions