Picture for Renshen Wang

Renshen Wang

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Add code
May 04, 2023
Figure 1 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 2 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 3 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 4 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Viaarxiv icon

Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Add code
May 04, 2023
Viaarxiv icon

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Add code
Mar 24, 2022
Figure 1 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Figure 2 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Figure 3 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Figure 4 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Viaarxiv icon

Unified Line and Paragraph Detection by Graph Convolutional Networks

Add code
Mar 17, 2022
Figure 1 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Figure 2 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Figure 3 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Figure 4 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Viaarxiv icon

ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Add code
Jun 21, 2021
Figure 1 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Figure 2 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Figure 3 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Figure 4 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Viaarxiv icon

General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks

Add code
Feb 01, 2021
Figure 1 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Figure 2 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Figure 3 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Figure 4 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Viaarxiv icon