Picture for Cong Yao

Cong Yao

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer

Add code
Sep 18, 2024
Viaarxiv icon

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

Add code
Aug 27, 2024
Viaarxiv icon

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation

Add code
Jul 22, 2024
Viaarxiv icon

Visual Text Generation in the Wild

Add code
Jul 19, 2024
Viaarxiv icon

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

Add code
Jul 17, 2024
Viaarxiv icon

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Add code
Apr 08, 2024
Viaarxiv icon

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Add code
Mar 28, 2024
Viaarxiv icon

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Mar 20, 2024
Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Jan 03, 2024
Viaarxiv icon

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Add code
Dec 19, 2023
Viaarxiv icon