Picture for Humen Zhong

Humen Zhong

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

Add code
Dec 03, 2024
Viaarxiv icon

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer

Add code
Sep 18, 2024
Viaarxiv icon

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

Add code
Aug 27, 2024
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Add code
Mar 29, 2023
Viaarxiv icon

ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter

Add code
Oct 20, 2021
Figure 1 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 2 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 3 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 4 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Viaarxiv icon

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

Add code
Apr 05, 2021
Figure 1 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 2 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 3 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 4 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Viaarxiv icon