Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

George Townsend

Effective Neural Solution for Multi-Criteria Word Segmentation

Jan 04, 2018

Han He, Lei Wu, Hua Yan, Zhimin Gao, Yi Feng, George Townsend

Figure 1 for Effective Neural Solution for Multi-Criteria Word Segmentation

Figure 2 for Effective Neural Solution for Multi-Criteria Word Segmentation

Figure 3 for Effective Neural Solution for Multi-Criteria Word Segmentation

Figure 4 for Effective Neural Solution for Multi-Criteria Word Segmentation

Abstract:We present a simple yet elegant solution to train a single joint model on multi-criteria corpora for Chinese Word Segmentation (CWS). Our novel design requires no private layers in model architecture, instead, introduces two artificial tokens at the beginning and ending of input sentence to specify the required target criteria. The rest of the model including Long Short-Term Memory (LSTM) layer and Conditional Random Fields (CRFs) layer remains unchanged and is shared across all datasets, keeping the size of parameter collection minimal and constant. On Bakeoff 2005 and Bakeoff 2008 datasets, our innovative design has surpassed both single-criterion and multi-criteria state-of-the-art learning results. To the best knowledge, our design is the first one that has achieved the latest high performance on such large scale datasets. Source codes and corpora of this paper are available on GitHub.

* 2nd International Conference on Smart Computing & Informatics (SCI-2018), Springer Smart Innovation Systems and Technologies Book Series, Springer-Verlag, Accepted & Forthcoming, 2018

Via

Access Paper or Ask Questions

Dual Long Short-Term Memory Networks for Sub-Character Representation Learning

Jan 04, 2018

Han He, Lei Wu, Xiaokun Yang, Hua Yan, Zhimin Gao, Yi Feng, George Townsend

Figure 1 for Dual Long Short-Term Memory Networks for Sub-Character Representation Learning

Figure 2 for Dual Long Short-Term Memory Networks for Sub-Character Representation Learning

Figure 3 for Dual Long Short-Term Memory Networks for Sub-Character Representation Learning

Figure 4 for Dual Long Short-Term Memory Networks for Sub-Character Representation Learning

Abstract:Characters have commonly been regarded as the minimal processing unit in Natural Language Processing (NLP). But many non-latin languages have hieroglyphic writing systems, involving a big alphabet with thousands or millions of characters. Each character is composed of even smaller parts, which are often ignored by the previous work. In this paper, we propose a novel architecture employing two stacked Long Short-Term Memory Networks (LSTMs) to learn sub-character level representation and capture deeper level of semantic meanings. To build a concrete study and substantiate the efficiency of our neural architecture, we take Chinese Word Segmentation as a research case example. Among those languages, Chinese is a typical case, for which every character contains several components called radicals. Our networks employ a shared radical level embedding to solve both Simplified and Traditional Chinese Word Segmentation, without extra Traditional to Simplified Chinese conversion, in such a highly end-to-end way the word segmentation can be significantly simplified compared to the previous work. Radical level embeddings can also capture deeper semantic meaning below character level and improve the system performance of learning. By tying radical and character embeddings together, the parameter count is reduced whereas semantic knowledge is shared and transferred between two levels, boosting the performance largely. On 3 out of 4 Bakeoff 2005 datasets, our method surpassed state-of-the-art results by up to 0.4%. Our results are reproducible, source codes and corpora are available on GitHub.

* Accepted & forthcoming at ITNG-2018

Via

Access Paper or Ask Questions