Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chilin Shih

AT&T Bell Laboratories

A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

May 05, 1994

Richard Sproat, Chilin Shih, William Gale, Nancy Chang

Figure 1 for A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

Figure 2 for A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

Figure 3 for A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

Figure 4 for A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

Abstract:We present a stochastic finite-state model for segmenting Chinese text into dictionary entries and productively derived words, and providing pronunciations for these words; the method incorporates a class-based model in its treatment of personal names. We also evaluate the system's performance, taking into account the fact that people often do not agree on a single segmentation.

* in Proceedings of ACL 94
* To appear in Proceedings of ACL-94

Via

Access Paper or Ask Questions