Picture for Yoonsik Kim

Yoonsik Kim

TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains

Add code
Apr 30, 2024
Figure 1 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Figure 2 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Figure 3 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Figure 4 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Viaarxiv icon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

Add code
Sep 21, 2023
Viaarxiv icon

Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

Add code
May 24, 2023
Figure 1 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 2 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 3 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 4 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Viaarxiv icon

Towards Unified Scene Text Spotting based on Sequence Generation

Add code
Apr 07, 2023
Viaarxiv icon

Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding

Add code
Nov 07, 2022
Viaarxiv icon

DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting

Add code
Mar 10, 2022
Figure 1 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Figure 2 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Figure 3 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Figure 4 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Viaarxiv icon

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

Add code
Nov 30, 2021
Figure 1 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Figure 2 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Figure 3 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Figure 4 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Viaarxiv icon

RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image

Add code
Jul 23, 2021
Figure 1 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Figure 2 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Figure 3 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Figure 4 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Viaarxiv icon

SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models

Add code
Jul 20, 2021
Figure 1 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Figure 2 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Figure 3 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Figure 4 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Viaarxiv icon