Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sebastian Brarda

Chargrid: Towards Understanding 2D Documents

Sep 24, 2018

Anoop Raveendra Katti, Christian Reisswig, Cordula Guder, Sebastian Brarda, Steffen Bickel, Johannes Höhne, Jean Baptiste Faddoul

Figure 1 for Chargrid: Towards Understanding 2D Documents

Figure 2 for Chargrid: Towards Understanding 2D Documents

Figure 3 for Chargrid: Towards Understanding 2D Documents

Figure 4 for Chargrid: Towards Understanding 2D Documents

Abstract:We introduce a novel type of text representation that preserves the 2D layout of a document. This is achieved by encoding each document page as a two-dimensional grid of characters. Based on this representation, we present a generic document understanding pipeline for structured documents. This pipeline makes use of a fully convolutional encoder-decoder network that predicts a segmentation mask and bounding boxes. We demonstrate its capabilities on an information extraction task from invoices and show that it significantly outperforms approaches based on sequential text or document images.

* To be published at EMNLP 2018

Via

Access Paper or Ask Questions

Sequential Attention: A Context-Aware Alignment Function for Machine Reading

Jun 26, 2017

Sebastian Brarda, Philip Yeres, Samuel R. Bowman

Figure 1 for Sequential Attention: A Context-Aware Alignment Function for Machine Reading

Figure 2 for Sequential Attention: A Context-Aware Alignment Function for Machine Reading

Figure 3 for Sequential Attention: A Context-Aware Alignment Function for Machine Reading

Figure 4 for Sequential Attention: A Context-Aware Alignment Function for Machine Reading

Abstract:In this paper we propose a neural network model with a novel Sequential Attention layer that extends soft attention by assigning weights to words in an input sequence in a way that takes into account not just how well that word matches a query, but how well surrounding words match. We evaluate this approach on the task of reading comprehension (on the Who did What and CNN datasets) and show that it dramatically improves a strong baseline--the Stanford Reader--and is competitive with the state of the art.

* To appear in ACL 2017 2nd Workshop on Representation Learning for NLP. Contains additional experiments in section 4 and a revised Figure 1

Via

Access Paper or Ask Questions