Picture for Yue Zhang

Yue Zhang

Renmin University of China

PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks

Add code
Mar 25, 2026
Viaarxiv icon

Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Add code
Mar 25, 2026
Viaarxiv icon

TimeWeaver: Age-Consistent Reference-Based Face Restoration with Identity Preservation

Add code
Mar 24, 2026
Viaarxiv icon

MeInTime: Bridging Age Gap in Identity-Preserving Face Restoration

Add code
Mar 19, 2026
Viaarxiv icon

BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding

Add code
Mar 19, 2026
Viaarxiv icon

Gym-V: A Unified Vision Environment System for Agentic Vision Research

Add code
Mar 17, 2026
Viaarxiv icon

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Add code
Mar 17, 2026
Viaarxiv icon

A Skill-augmented Agentic Framework and Benchmark for Multi-Video Understanding

Add code
Mar 16, 2026
Viaarxiv icon

VisionCoach: Reinforcing Grounded Video Reasoning via Visual-Perception Prompting

Add code
Mar 15, 2026
Viaarxiv icon

Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers

Add code
Mar 13, 2026
Viaarxiv icon