Picture for Xiaofeng Zhang

Xiaofeng Zhang

DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark

Add code
Nov 05, 2024
Viaarxiv icon

High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer

Add code
Oct 30, 2024
Viaarxiv icon

GiVE: Guiding Visual Encoder to Perceive Overlooked Information

Add code
Oct 26, 2024
Viaarxiv icon

Bridging the Gap between Text, Audio, Image, and Any Sequence: A Novel Approach using Gloss-based Annotation

Add code
Oct 04, 2024
Viaarxiv icon

Instance-adaptive Zero-shot Chain-of-Thought Prompting

Add code
Sep 30, 2024
Viaarxiv icon

DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

Add code
Jul 23, 2024
Viaarxiv icon

From Redundancy to Relevance: Enhancing Explainability in Multimodal Large Language Models

Add code
Jun 04, 2024
Viaarxiv icon

Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition

Add code
Feb 03, 2024
Viaarxiv icon

Building Lane-Level Maps from Aerial Images

Add code
Dec 20, 2023
Viaarxiv icon

Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Add code
Dec 15, 2023
Viaarxiv icon