Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anoop Raveendra Katti

Chargrid: Towards Understanding 2D Documents

Sep 24, 2018

Anoop Raveendra Katti, Christian Reisswig, Cordula Guder, Sebastian Brarda, Steffen Bickel, Johannes Höhne, Jean Baptiste Faddoul

Figure 1 for Chargrid: Towards Understanding 2D Documents

Figure 2 for Chargrid: Towards Understanding 2D Documents

Figure 3 for Chargrid: Towards Understanding 2D Documents

Figure 4 for Chargrid: Towards Understanding 2D Documents

Abstract:We introduce a novel type of text representation that preserves the 2D layout of a document. This is achieved by encoding each document page as a two-dimensional grid of characters. Based on this representation, we present a generic document understanding pipeline for structured documents. This pipeline makes use of a fully convolutional encoder-decoder network that predicts a segmentation mask and bounding boxes. We demonstrate its capabilities on an information extraction task from invoices and show that it significantly outperforms approaches based on sequential text or document images.

* To be published at EMNLP 2018

Via

Access Paper or Ask Questions