Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

Feb 08, 2018

Gregor Wiedemann, Gerhard Heyer

Figure 1 for Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

Figure 2 for Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

Figure 3 for Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

Figure 4 for Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

Share this with someone who'll enjoy it:

Abstract:In recent years, (retro-)digitizing paper-based files became a major undertaking for private and public archives as well as an important task in electronic mailroom applications. As a first step, the workflow involves scanning and Optical Character Recognition (OCR) of documents. Preservation of document contexts of single page scans is a major requirement in this context. To facilitate workflows involving very large amounts of paper scans, page stream segmentation (PSS) is the task to automatically separate a stream of scanned images into multi-page documents. In a digitization project together with a German federal archive, we developed a novel approach based on convolutional neural networks (CNN) combining image and text features to achieve optimal document separation results. Evaluation shows that our PSS architecture achieves an accuracy up to 93 % which can be regarded as a new state-of-the-art for this task.

* Full paper version: 6 pages, 3 figures, 2 tables, accepted for LREC 2018

View paper on

Share this with someone who'll enjoy it:

Title:Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

Paper and Code