Picture for Wenqing Cheng

Wenqing Cheng

Generative Compositor for Few-Shot Visual Information Extraction

Add code
Mar 21, 2025
Viaarxiv icon

OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models

Add code
Feb 22, 2025
Viaarxiv icon

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer

Add code
Sep 18, 2024
Viaarxiv icon

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Add code
Mar 28, 2024
Viaarxiv icon

DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation

Add code
Mar 20, 2024
Figure 1 for DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
Figure 2 for DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
Figure 3 for DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
Figure 4 for DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Add code
Mar 29, 2023
Figure 1 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 2 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 3 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 4 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Viaarxiv icon

Vision-Language Pre-Training for Boosting Scene Text Detectors

Add code
Apr 29, 2022
Figure 1 for Vision-Language Pre-Training for Boosting Scene Text Detectors
Figure 2 for Vision-Language Pre-Training for Boosting Scene Text Detectors
Figure 3 for Vision-Language Pre-Training for Boosting Scene Text Detectors
Figure 4 for Vision-Language Pre-Training for Boosting Scene Text Detectors
Viaarxiv icon

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

Add code
Apr 05, 2021
Figure 1 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 2 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 3 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Figure 4 for MOST: A Multi-Oriented Scene Text Detector with Localization Refinement
Viaarxiv icon

Progressive and Aligned Pose Attention Transfer for Person Image Generation

Add code
Mar 22, 2021
Figure 1 for Progressive and Aligned Pose Attention Transfer for Person Image Generation
Figure 2 for Progressive and Aligned Pose Attention Transfer for Person Image Generation
Figure 3 for Progressive and Aligned Pose Attention Transfer for Person Image Generation
Figure 4 for Progressive and Aligned Pose Attention Transfer for Person Image Generation
Viaarxiv icon