Picture for Zhuoyao Zhong

Zhuoyao Zhong

Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis

Add code
Jan 22, 2024
Viaarxiv icon

Dynamic Relation Transformer for Contextual Text Block Detection

Add code
Jan 17, 2024
Figure 1 for Dynamic Relation Transformer for Contextual Text Block Detection
Figure 2 for Dynamic Relation Transformer for Contextual Text Block Detection
Figure 3 for Dynamic Relation Transformer for Contextual Text Block Detection
Figure 4 for Dynamic Relation Transformer for Contextual Text Block Detection
Viaarxiv icon

UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-like Documents

Add code
Jan 17, 2024
Viaarxiv icon

Exploring Predicate Visual Context in Detecting of Human-Object Interactions

Add code
Aug 11, 2023
Viaarxiv icon

A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images

Add code
Apr 17, 2023
Figure 1 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 2 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 3 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Figure 4 for A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images
Viaarxiv icon

ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents

Add code
May 25, 2021
Figure 1 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Figure 2 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Figure 3 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Figure 4 for ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Viaarxiv icon

ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks

Add code
Mar 16, 2020
Figure 1 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Figure 2 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Figure 3 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Figure 4 for ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks
Viaarxiv icon

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

Add code
Nov 22, 2018
Figure 1 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Figure 2 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Figure 3 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Figure 4 for Mask R-CNN with Pyramid Attention Network for Scene Text Detection
Viaarxiv icon

An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches

Add code
Apr 24, 2018
Figure 1 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Figure 2 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Figure 3 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Figure 4 for An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Viaarxiv icon

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

Add code
May 24, 2016
Figure 1 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Figure 2 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Figure 3 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Figure 4 for DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
Viaarxiv icon