Picture for Mingkun Yang

Mingkun Yang

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

Add code
Dec 03, 2024
Viaarxiv icon

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

Add code
Feb 24, 2024
Viaarxiv icon

Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition

Add code
Feb 21, 2024
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Viaarxiv icon

Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition

Add code
Jul 01, 2022
Figure 1 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 2 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 3 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 4 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Viaarxiv icon

DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry

Add code
May 20, 2021
Figure 1 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Figure 2 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Figure 3 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Figure 4 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Viaarxiv icon

Scene Text Retrieval via Joint Text Detection and Similarity Learning

Add code
Apr 04, 2021
Figure 1 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Figure 2 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Figure 3 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Figure 4 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Viaarxiv icon

Efficient Backbone Search for Scene Text Recognition

Add code
Mar 14, 2020
Figure 1 for Efficient Backbone Search for Scene Text Recognition
Figure 2 for Efficient Backbone Search for Scene Text Recognition
Figure 3 for Efficient Backbone Search for Scene Text Recognition
Figure 4 for Efficient Backbone Search for Scene Text Recognition
Viaarxiv icon

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard

Add code
Dec 20, 2019
Figure 1 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Figure 2 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Figure 3 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Figure 4 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Viaarxiv icon

All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting

Add code
Nov 21, 2019
Figure 1 for All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
Figure 2 for All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
Figure 3 for All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
Figure 4 for All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
Viaarxiv icon