Picture for Dinei Florencio

Dinei Florencio

From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding

Add code
May 23, 2023
Viaarxiv icon

Diffusion-based Document Layout Generation

Add code
Mar 19, 2023
Viaarxiv icon

Understanding Long Documents with Different Position-Aware Attentions

Add code
Aug 17, 2022
Figure 1 for Understanding Long Documents with Different Position-Aware Attentions
Figure 2 for Understanding Long Documents with Different Position-Aware Attentions
Figure 3 for Understanding Long Documents with Different Position-Aware Attentions
Figure 4 for Understanding Long Documents with Different Position-Aware Attentions
Viaarxiv icon

Improving Structured Text Recognition with Regular Expression Biasing

Add code
Nov 10, 2021
Figure 1 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 2 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 3 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 4 for Improving Structured Text Recognition with Regular Expression Biasing
Viaarxiv icon

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

Add code
Sep 25, 2021
Figure 1 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Figure 2 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Figure 3 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Figure 4 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Viaarxiv icon

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

Add code
Apr 18, 2021
Figure 1 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 2 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 3 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 4 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Viaarxiv icon

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Add code
Dec 29, 2020
Figure 1 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 2 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 3 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 4 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Viaarxiv icon

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption

Add code
Dec 08, 2020
Figure 1 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 2 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 3 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 4 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Viaarxiv icon

RePr: Improved Training of Convolutional Filters

Add code
Nov 26, 2018
Figure 1 for RePr: Improved Training of Convolutional Filters
Figure 2 for RePr: Improved Training of Convolutional Filters
Figure 3 for RePr: Improved Training of Convolutional Filters
Figure 4 for RePr: Improved Training of Convolutional Filters
Viaarxiv icon

A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain

Add code
Apr 16, 2018
Figure 1 for A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain
Figure 2 for A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain
Figure 3 for A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain
Figure 4 for A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain
Viaarxiv icon