Picture for Zelun Zhang

Zelun Zhang

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon

PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Add code
Jan 29, 2026
Viaarxiv icon

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Add code
Oct 16, 2025
Viaarxiv icon

TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning

Add code
Jul 07, 2023
Figure 1 for TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning
Figure 2 for TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning
Figure 3 for TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning
Figure 4 for TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning
Viaarxiv icon