Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Manuel Carbonell

TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

Dec 20, 2019

Manuel Carbonell, Alicia Fornés, Mauricio Villegas, Josep Lladós

Figure 1 for TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

Figure 2 for TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

Figure 3 for TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

Figure 4 for TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

Abstract:In the last years, the consolidation of deep neural network architectures for information extraction in document images has brought big improvements in the performance of each of the tasks involved in this process, consisting of text localization, transcription, and named entity recognition. However, this process is traditionally performed with separate methods for each task. In this work we propose an end-to-end model that jointly performs handwritten text detection, transcription, and named entity recognition at page level, capable of benefiting from shared features for these tasks. We exhaustively evaluate our approach on different datasets, discussing its advantages and limitations compared to sequential approaches.

* Submitted to Pattern Recognition Letters

Via

Access Paper or Ask Questions

Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

Mar 22, 2018

Manuel Carbonell, Mauricio Villegas, Alicia Fornés, Josep Lladós

Figure 1 for Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

Figure 2 for Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

Figure 3 for Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

Figure 4 for Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

Abstract:When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognition. Experimentally, the work has been tested on a collection of historical marriage records. Results of experiments are presented to show the effect on the performance for different configurations: different ways of encoding the information, doing or not transfer learning and processing at text line or multi-line region level. The results are comparable to state of the art reported in the ICDAR 2017 Information Extraction competition, even though the proposed technique does not use any dictionaries, language modeling or post processing.

* To appear in IAPR International Workshop on Document Analysis Systems 2018 (DAS 2018)

Via

Access Paper or Ask Questions