Picture for Zhuoyao Zhong

Zhuoyao Zhong

Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis

Add code
Jan 22, 2024
Viaarxiv icon

Dynamic Relation Transformer for Contextual Text Block Detection

Add code
Jan 17, 2024
Viaarxiv icon

UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents

Add code
Jan 17, 2024
Viaarxiv icon

Exploring Predicate Visual Context in Detecting of Human-Object Interactions

Add code
Aug 11, 2023
Viaarxiv icon

A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images

Add code
Apr 17, 2023
Figure 1 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 2 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 3 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 4 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Viaarxiv icon

ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents

Add code
May 25, 2021
Figure 1 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Figure 2 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Figure 3 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Figure 4 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Viaarxiv icon

ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks

Add code
Mar 16, 2020
Figure 1 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Figure 2 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Figure 3 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Figure 4 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Viaarxiv icon

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

Add code
Nov 22, 2018
Figure 1 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Figure 2 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Figure 3 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Figure 4 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Viaarxiv icon

An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches

Add code
Apr 24, 2018
Figure 1 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Figure 2 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Figure 3 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Figure 4 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Viaarxiv icon

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

Add code
May 24, 2016
Figure 1 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Figure 2 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Figure 3 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Figure 4 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Viaarxiv icon