Picture for Guozhi Tang

Guozhi Tang

MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark

Add code
Oct 15, 2024
Viaarxiv icon

ParGo: Bridging Vision-Language with Partial and Global Views

Add code
Aug 23, 2024
Viaarxiv icon

Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding

Add code
Jun 27, 2022
Figure 1 for Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Figure 2 for Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Figure 3 for Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Figure 4 for Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Viaarxiv icon

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

Add code
Jun 24, 2021
Figure 1 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Figure 2 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Figure 3 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Figure 4 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Viaarxiv icon

Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

Add code
Jun 20, 2021
Figure 1 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Figure 2 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Figure 3 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Figure 4 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Viaarxiv icon

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Add code
Jan 24, 2021
Figure 1 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Figure 2 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Figure 3 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Figure 4 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Viaarxiv icon