Picture for Xiaofeng Zhang

Xiaofeng Zhang

CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance

Add code
Aug 10, 2025
Viaarxiv icon

Bias Analysis in Unconditional Image Generative Models

Add code
Jun 10, 2025
Viaarxiv icon

CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics

Add code
Jun 10, 2025
Viaarxiv icon

AdaToken-3D: Dynamic Spatial Gating for Efficient 3D Large Multimodal-Models Reasoning

Add code
May 19, 2025
Viaarxiv icon

LensNet: An End-to-End Learning Framework for Empirical Point Spread Function Modeling and Lensless Imaging Reconstruction

Add code
May 03, 2025
Viaarxiv icon

Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach

Add code
Mar 17, 2025
Figure 1 for Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach
Figure 2 for Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach
Figure 3 for Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach
Figure 4 for Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach
Viaarxiv icon

PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving

Add code
Dec 02, 2024
Figure 1 for PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving
Figure 2 for PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving
Figure 3 for PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving
Figure 4 for PKRD-CoT: A Unified Chain-of-thought Prompting for Multi-Modal Large Language Models in Autonomous Driving
Viaarxiv icon

Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs

Add code
Nov 15, 2024
Figure 1 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Figure 2 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Figure 3 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Figure 4 for Seeing Clearly by Layer Two: Enhancing Attention Heads to Alleviate Hallucination in LVLMs
Viaarxiv icon

DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark

Add code
Nov 05, 2024
Viaarxiv icon

High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer

Add code
Oct 30, 2024
Viaarxiv icon