Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

May 04, 2023

Renshen Wang, Yasuhisa Fujii, Alessandro Bissacco

Figure 1 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Figure 2 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Figure 3 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Figure 4 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Share this with someone who'll enjoy it:

Abstract:Text reading order is a crucial aspect in the output of an OCR engine, with a large impact on downstream tasks. Its difficulty lies in the large variation of domain specific layout structures, and is further exacerbated by real-world image degradations such as perspective distortions. We propose a lightweight, scalable and generalizable approach to identify text reading order with a multi-modal, multi-task graph convolutional network (GCN) running on a sparse layout based graph. Predictions from the model provide hints of bidimensional relations among text lines and layout region structures, upon which a post-processing cluster-and-sort algorithm generates an ordered sequence of all the text lines. The model is language-agnostic and runs effectively across multi-language datasets that contain various types of images taken in uncontrolled conditions, and it is small enough to be deployed on virtually any platform including mobile devices.

* Accepted to ICDAR 2023

View paper on

Share this with someone who'll enjoy it:

Title:Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Paper and Code