Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kangwei Ling

Effective Subword Segmentation for Text Comprehension

Nov 06, 2018

Zhuosheng Zhang, Hai Zhao, Kangwei Ling, Jiangtong Li, Zuchao Li, Shexia He

Figure 1 for Effective Subword Segmentation for Text Comprehension

Figure 2 for Effective Subword Segmentation for Text Comprehension

Figure 3 for Effective Subword Segmentation for Text Comprehension

Figure 4 for Effective Subword Segmentation for Text Comprehension

Abstract:Character-level representations have been broadly adopted to alleviate the problem of effectively representing rare or complex words. However, character itself is not a natural minimal linguistic unit for representation or word embedding composing due to ignoring the linguistic coherence of consecutive characters inside word. This paper presents a general subword-augmented embedding framework for learning and composing computationally-derived subword-level representations. We survey a series of unsupervised segmentation methods for subword acquisition and different subword-augmented strategies for text understanding, showing that subword-augmented embedding significantly improves our baselines in multiple text understanding tasks on both English and Chinese languages.

Via

Access Paper or Ask Questions