Picture for Zhi Yu

Zhi Yu

National Mobile Communications Research Laboratory, Southeast University, Nanjing, China

One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning

Add code
Jan 28, 2025
Viaarxiv icon

Wireless Communication with Flexible Reflector: Joint Placement and Rotation Optimization for Coverage Enhancement

Add code
Dec 25, 2024
Figure 1 for Wireless Communication with Flexible Reflector: Joint Placement and Rotation Optimization for Coverage Enhancement
Figure 2 for Wireless Communication with Flexible Reflector: Joint Placement and Rotation Optimization for Coverage Enhancement
Figure 3 for Wireless Communication with Flexible Reflector: Joint Placement and Rotation Optimization for Coverage Enhancement
Figure 4 for Wireless Communication with Flexible Reflector: Joint Placement and Rotation Optimization for Coverage Enhancement
Viaarxiv icon

Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding

Add code
Nov 12, 2024
Figure 1 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 2 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 3 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 4 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Viaarxiv icon

SAM-SP: Self-Prompting Makes SAM Great Again

Add code
Aug 22, 2024
Figure 1 for SAM-SP: Self-Prompting Makes SAM Great Again
Figure 2 for SAM-SP: Self-Prompting Makes SAM Great Again
Figure 3 for SAM-SP: Self-Prompting Makes SAM Great Again
Figure 4 for SAM-SP: Self-Prompting Makes SAM Great Again
Viaarxiv icon

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation

Add code
Jul 22, 2024
Viaarxiv icon

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

Add code
Jul 17, 2024
Viaarxiv icon

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Add code
Apr 08, 2024
Viaarxiv icon

Less is More : A Closer Look at Multi-Modal Few-Shot Learning

Add code
Jan 10, 2024
Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Jan 03, 2024
Viaarxiv icon

Multi-View Fusion and Distillation for Subgrade Distresses Detection based on 3D-GPR

Add code
Aug 09, 2023
Viaarxiv icon