Picture for Xuming Hu

Xuming Hu

Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios

Add code
Nov 05, 2024
Viaarxiv icon

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models

Add code
Oct 29, 2024
Viaarxiv icon

NeuGPT: Unified multi-modal Neural GPT

Add code
Oct 28, 2024
Viaarxiv icon

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality

Add code
Oct 07, 2024
Figure 1 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Figure 2 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Figure 3 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Figure 4 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Viaarxiv icon

ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

Add code
Oct 06, 2024
Figure 1 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 2 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 3 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 4 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Viaarxiv icon

LongGenBench: Long-context Generation Benchmark

Add code
Oct 05, 2024
Viaarxiv icon

Can Watermarked LLMs be Identified by Users via Crafted Prompts?

Add code
Oct 04, 2024
Viaarxiv icon

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

DRUPI: Dataset Reduction Using Privileged Information

Add code
Oct 02, 2024
Viaarxiv icon