Picture for Pengyuan Lyu

Pengyuan Lyu

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models

Add code
Oct 23, 2024
Viaarxiv icon

WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting

Add code
Jul 28, 2024
Viaarxiv icon

StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond

Add code
Jun 04, 2024
Viaarxiv icon

Towards Unified Multi-granularity Text Detection with Interactive Attention

Add code
May 30, 2024
Viaarxiv icon

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction

Add code
Sep 26, 2023
Figure 1 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 2 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 3 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 4 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Viaarxiv icon

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

Add code
Aug 14, 2023
Figure 1 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 2 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 3 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 4 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Jun 05, 2023
Viaarxiv icon

Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter

Add code
Jul 18, 2022
Figure 1 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Figure 2 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Figure 3 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Figure 4 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Viaarxiv icon

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

Add code
Jun 01, 2022
Figure 1 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Figure 2 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Figure 3 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Figure 4 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Viaarxiv icon

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Add code
Apr 12, 2021
Figure 1 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Figure 2 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Figure 3 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Figure 4 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Viaarxiv icon