Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ashok Popat

ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Jun 21, 2021

Chen-Yu Lee, Chun-Liang Li, Chu Wang, Renshen Wang, Yasuhisa Fujii, Siyang Qin, Ashok Popat, Tomas Pfister

Figure 1 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Figure 2 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Figure 3 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Figure 4 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Abstract:Natural reading orders of words are crucial for information extraction from form-like documents. Despite recent advances in Graph Convolutional Networks (GCNs) on modeling spatial layout patterns of documents, they have limited ability to capture reading orders of given word-level node representations in a graph. We propose Reading Order Equivariant Positional Encoding (ROPE), a new positional encoding technique designed to apprehend the sequential presentation of words in documents. ROPE generates unique reading order codes for neighboring words relative to the target word given a word-level graph connectivity. We study two fundamental document entity extraction tasks including word labeling and word grouping on the public FUNSD dataset and a large-scale payment dataset. We show that ROPE consistently improves existing GCNs with a margin up to 8.4% F1-score.

* Accepted to ACL-IJCNLP 2021 (Oral)

Via

Access Paper or Ask Questions