Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adir Cohen

Text Segmentation as a Supervised Learning Task

Mar 25, 2018

Omri Koshorek, Adir Cohen, Noam Mor, Michael Rotman, Jonathan Berant

Figure 1 for Text Segmentation as a Supervised Learning Task

Figure 2 for Text Segmentation as a Supervised Learning Task

Figure 3 for Text Segmentation as a Supervised Learning Task

Abstract:Text segmentation, the task of dividing a document into contiguous segments based on its semantic structure, is a longstanding challenge in language understanding. Previous work on text segmentation focused on unsupervised methods such as clustering or graph search, due to the paucity in labeled data. In this work, we formulate text segmentation as a supervised learning problem, and present a large new dataset for text segmentation that is automatically extracted and labeled from Wikipedia. Moreover, we develop a segmentation model based on this dataset and show that it generalizes well to unseen natural text.

* 5 pages, 1 figure, NAACL 2018

Via

Access Paper or Ask Questions