Picture for Jing Zhang

Jing Zhang

The University of Sydney, Australia

PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks

Add code
Mar 25, 2026
Viaarxiv icon

Heuristic-inspired Reasoning Priors Facilitate Data-Efficient Referring Object Detection

Add code
Mar 25, 2026
Viaarxiv icon

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Add code
Mar 25, 2026
Viaarxiv icon

Feature Incremental Clustering with Generalization Bounds

Add code
Mar 23, 2026
Viaarxiv icon

ReconMIL: Synergizing Latent Space Reconstruction with Bi-Stream Mamba for Whole Slide Image Analysis

Add code
Mar 20, 2026
Viaarxiv icon

Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code Generation

Add code
Mar 18, 2026
Viaarxiv icon

Action Draft and Verify: A Self-Verifying Framework for Vision-Language-Action Model

Add code
Mar 18, 2026
Viaarxiv icon

TimeAPN: Adaptive Amplitude-Phase Non-Stationarity Normalization for Time Series Forecasting

Add code
Mar 18, 2026
Viaarxiv icon

Detecting Fake Reviewer Groups in Dynamic Networks: An Adaptive Graph Learning Method

Add code
Mar 09, 2026
Viaarxiv icon

VSDiffusion: Taming Ill-Posed Shadow Generation via Visibility-Constrained Diffusion

Add code
Mar 09, 2026
Viaarxiv icon