Picture for Yijuan Lu

Yijuan Lu

Diffusion-based Document Layout Generation

Add code
Mar 19, 2023
Viaarxiv icon

Understanding Long Documents with Different Position-Aware Attentions

Add code
Aug 17, 2022
Figure 1 for Understanding Long Documents with Different Position-Aware Attentions
Figure 2 for Understanding Long Documents with Different Position-Aware Attentions
Figure 3 for Understanding Long Documents with Different Position-Aware Attentions
Figure 4 for Understanding Long Documents with Different Position-Aware Attentions
Viaarxiv icon

Improving Structured Text Recognition with Regular Expression Biasing

Add code
Nov 10, 2021
Figure 1 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 2 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 3 for Improving Structured Text Recognition with Regular Expression Biasing
Figure 4 for Improving Structured Text Recognition with Regular Expression Biasing
Viaarxiv icon

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

Add code
Sep 25, 2021
Figure 1 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Figure 2 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Figure 3 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Figure 4 for TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
Viaarxiv icon

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

Add code
Apr 18, 2021
Figure 1 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 2 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 3 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Figure 4 for LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
Viaarxiv icon

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Add code
Dec 29, 2020
Figure 1 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 2 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 3 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Figure 4 for LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Viaarxiv icon

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption

Add code
Dec 08, 2020
Figure 1 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 2 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 3 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Figure 4 for TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Viaarxiv icon

Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language

Add code
Dec 04, 2020
Figure 1 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Figure 2 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Figure 3 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Figure 4 for Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Viaarxiv icon

Shape retrieval of non-rigid 3d human models

Add code
Mar 01, 2020
Figure 1 for Shape retrieval of non-rigid 3d human models
Figure 2 for Shape retrieval of non-rigid 3d human models
Figure 3 for Shape retrieval of non-rigid 3d human models
Figure 4 for Shape retrieval of non-rigid 3d human models
Viaarxiv icon

FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking

Add code
Nov 24, 2018
Figure 1 for FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking
Figure 2 for FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking
Figure 3 for FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking
Figure 4 for FANet: Quality-Aware Feature Aggregation Network for RGB-T Tracking
Viaarxiv icon