Picture for Jianshu Zhang

Jianshu Zhang

Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code

Add code
Oct 24, 2024
Viaarxiv icon

Personalized Visual Instruction Tuning

Add code
Oct 09, 2024
Figure 1 for Personalized Visual Instruction Tuning
Figure 2 for Personalized Visual Instruction Tuning
Figure 3 for Personalized Visual Instruction Tuning
Figure 4 for Personalized Visual Instruction Tuning
Viaarxiv icon

See then Tell: Enhancing Key Information Extraction with Vision Grounding

Add code
Sep 29, 2024
Figure 1 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Figure 2 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Figure 3 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Figure 4 for See then Tell: Enhancing Key Information Extraction with Vision Grounding
Viaarxiv icon

DocMamba: Efficient Document Pre-training with State Space Model

Add code
Sep 18, 2024
Viaarxiv icon

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

Add code
Aug 22, 2024
Viaarxiv icon

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Add code
Jun 13, 2024
Viaarxiv icon

Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions

Add code
Jun 11, 2024
Viaarxiv icon

CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replay

Add code
Feb 02, 2024
Viaarxiv icon

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Add code
Jul 30, 2023
Viaarxiv icon

HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures

Add code
Mar 24, 2023
Viaarxiv icon