Picture for Shangbang Long

Shangbang Long

PaliGemma 2: A Family of Versatile VLMs for Transfer

Add code
Dec 04, 2024
Viaarxiv icon

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Add code
Jun 06, 2024
Viaarxiv icon

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Add code
Oct 25, 2023
Viaarxiv icon

ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

Add code
May 16, 2023
Viaarxiv icon

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Add code
May 04, 2023
Figure 1 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 2 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 3 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 4 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Viaarxiv icon

Towards End-to-End Unified Scene Text Detection and Layout Analysis

Add code
Mar 28, 2022
Figure 1 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 2 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 3 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 4 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Viaarxiv icon

UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World

Add code
Mar 24, 2020
Figure 1 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Figure 2 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Figure 3 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Figure 4 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Viaarxiv icon

A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling

Add code
Feb 10, 2020
Figure 1 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Figure 2 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Figure 3 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Figure 4 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Viaarxiv icon

Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition

Add code
Aug 30, 2019
Figure 1 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Figure 2 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Figure 3 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Figure 4 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Viaarxiv icon

SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds

Add code
Jul 13, 2019
Figure 1 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Figure 2 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Figure 3 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Figure 4 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Viaarxiv icon