Picture for Shangbang Long

Shangbang Long

PaliGemma 2: A Family of Versatile VLMs for Transfer

Add code
Dec 04, 2024
Figure 1 for PaliGemma 2: A Family of Versatile VLMs for Transfer
Figure 2 for PaliGemma 2: A Family of Versatile VLMs for Transfer
Figure 3 for PaliGemma 2: A Family of Versatile VLMs for Transfer
Figure 4 for PaliGemma 2: A Family of Versatile VLMs for Transfer
Viaarxiv icon

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Add code
Jun 06, 2024
Viaarxiv icon

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Add code
Oct 25, 2023
Figure 1 for Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Figure 2 for Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Figure 3 for Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Figure 4 for Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis
Viaarxiv icon

ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

Add code
May 16, 2023
Viaarxiv icon

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Add code
May 04, 2023
Figure 1 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 2 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 3 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 4 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Viaarxiv icon

Towards End-to-End Unified Scene Text Detection and Layout Analysis

Add code
Mar 28, 2022
Figure 1 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 2 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 3 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 4 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Viaarxiv icon

UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World

Add code
Mar 24, 2020
Figure 1 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Figure 2 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Figure 3 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Figure 4 for UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
Viaarxiv icon

A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling

Add code
Feb 10, 2020
Figure 1 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Figure 2 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Figure 3 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Figure 4 for A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Viaarxiv icon

Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition

Add code
Aug 30, 2019
Figure 1 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Figure 2 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Figure 3 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Figure 4 for Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition
Viaarxiv icon

SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds

Add code
Jul 13, 2019
Figure 1 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Figure 2 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Figure 3 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Figure 4 for SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds
Viaarxiv icon