Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Torsten Kilias

IDEL: In-Database Entity Linking with Neural Embeddings

Mar 13, 2018

Torsten Kilias, Alexander Löser, Felix A. Gers, Richard Koopmanschap, Ying Zhang, Martin Kersten

Figure 1 for IDEL: In-Database Entity Linking with Neural Embeddings

Figure 2 for IDEL: In-Database Entity Linking with Neural Embeddings

Figure 3 for IDEL: In-Database Entity Linking with Neural Embeddings

Figure 4 for IDEL: In-Database Entity Linking with Neural Embeddings

Abstract:We present a novel architecture, In-Database Entity Linking (IDEL), in which we integrate the analytics-optimized RDBMS MonetDB with neural text mining abilities. Our system design abstracts core tasks of most neural entity linking systems for MonetDB. To the best of our knowledge, this is the first defacto implemented system integrating entity-linking in a database. We leverage the ability of MonetDB to support in-database-analytics with user defined functions (UDFs) implemented in Python. These functions call machine learning libraries for neural text mining, such as TensorFlow. The system achieves zero cost for data shipping and transformation by utilizing MonetDB's ability to embed Python processes in the database kernel and exchange data in NumPy arrays. IDEL represents text and relational data in a joint vector space with neural embeddings and can compensate errors with ambiguous entity representations. For detecting matching entities, we propose a novel similarity function based on joint neural embeddings which are learned via minimizing pairwise contrastive ranking loss. This function utilizes a high dimensional index structures for fast retrieval of matching entities. Our first implementation and experiments using the WebNLG corpus show the effectiveness and the potentials of IDEL.

* This manuscript is a preprint for a paper submitted to VLDB2018

Via

Access Paper or Ask Questions

Robust Named Entity Recognition in Idiosyncratic Domains

Aug 24, 2016

Sebastian Arnold, Felix A. Gers, Torsten Kilias, Alexander Löser

Figure 1 for Robust Named Entity Recognition in Idiosyncratic Domains

Figure 2 for Robust Named Entity Recognition in Idiosyncratic Domains

Figure 3 for Robust Named Entity Recognition in Idiosyncratic Domains

Figure 4 for Robust Named Entity Recognition in Idiosyncratic Domains

Abstract:Named entity recognition often fails in idiosyncratic domains. That causes a problem for depending tasks, such as entity linking and relation extraction. We propose a generic and robust approach for high-recall named entity recognition. Our approach is easy to train and offers strong generalization over diverse domain-specific language, such as news documents (e.g. Reuters) or biomedical text (e.g. Medline). Our approach is based on deep contextual sequence learning and utilizes stacked bidirectional LSTM networks. Our model is trained with only few hundred labeled sentences and does not rely on further external knowledge. We report from our results F1 scores in the range of 84-94% on standard datasets.

* 8 pages, 1 figure

Via

Access Paper or Ask Questions