Picture for Humen Zhong

Humen Zhong

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

Add code
Dec 03, 2024
Viaarxiv icon

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer

Add code
Sep 18, 2024
Viaarxiv icon

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

Add code
Aug 27, 2024
Figure 1 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 2 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 3 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 4 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Add code
Mar 29, 2023
Figure 1 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 2 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 3 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 4 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Viaarxiv icon

ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter

Add code
Oct 20, 2021
Figure 1 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 2 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 3 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Figure 4 for ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter
Viaarxiv icon

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

Add code
Apr 05, 2021
Figure 1 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 2 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 3 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 4 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Viaarxiv icon