Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Neural Named Entity Recognition from Subword Units

Aug 27, 2018

Abdalghani Abujabal, Judith Gaspers

Figure 1 for Neural Named Entity Recognition from Subword Units

Figure 2 for Neural Named Entity Recognition from Subword Units

Figure 3 for Neural Named Entity Recognition from Subword Units

Figure 4 for Neural Named Entity Recognition from Subword Units

Share this with someone who'll enjoy it:

Abstract:Named entity recognition (NER) is a vital task in language technology. Existing neural models for NER rely mostly on dedicated word-level representations, which suffer from two main shortcomings: 1) the vocabulary size is large, yielding large memory requirements and training time, and 2) they cannot learn morphological representations. We adopt a neural solution based on bidirectional LSTMs and conditional random fields, where we rely on subword units, namely characters, phonemes, and bytes, to remedy the above shortcomings. We conducted experiments on a large dataset covering four languages with up to 5.5M utterances per language. Our experiments show that 1) with increasing training data, performance of models trained solely on subword units becomes closer to that of models with dedicated word-level embeddings (91.35 vs 93.92 F1 for English), while using a much smaller vocabulary size (332 vs 74K), 2) subword units enhance models with dedicated word-level embeddings, and 3) combining different subword units improves performance.

* 7 pages, 2 figures

View paper on

Share this with someone who'll enjoy it:

Title:Neural Named Entity Recognition from Subword Units

Paper and Code