Picture for Zhenrong Zhang

Zhenrong Zhang

See then Tell: Enhancing Key Information Extraction with Vision Grounding

Add code
Sep 29, 2024
Figure 1 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Figure 2 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Figure 3 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Figure 4 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Viaarxiv icon

DocMamba: Efficient Document Pre-training with State Space Model

Add code
Sep 18, 2024
Viaarxiv icon

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Add code
Jun 13, 2024
Viaarxiv icon

Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Add code
May 24, 2024
Viaarxiv icon

SEMv3: A Fast and Robust Approach to Table Separation Line Detection

Add code
May 20, 2024
Viaarxiv icon

On the Federated Learning Framework for Cooperative Perception

Add code
Apr 26, 2024
Viaarxiv icon

A Dataset and Model for Realistic License Plate Deblurring

Add code
Apr 23, 2024
Figure 1 for A Dataset and Model for Realistic License Plate Deblurring
Figure 2 for A Dataset and Model for Realistic License Plate Deblurring
Figure 3 for A Dataset and Model for Realistic License Plate Deblurring
Figure 4 for A Dataset and Model for Realistic License Plate Deblurring
Viaarxiv icon

Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition

Add code
Dec 31, 2023
Viaarxiv icon

LEGO: Learning and Graph-Optimized Modular Tracker for Online Multi-Object Tracking with Point Clouds

Add code
Aug 19, 2023
Viaarxiv icon

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Add code
Jul 30, 2023
Viaarxiv icon