Picture for Xin Zou

Xin Zou

Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models

Add code
Feb 20, 2025
Viaarxiv icon

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning

Add code
Feb 02, 2025
Figure 1 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 2 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 3 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 4 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Viaarxiv icon

MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention

Add code
Dec 30, 2024
Figure 1 for MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention
Figure 2 for MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention
Figure 3 for MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention
Figure 4 for MarsSQE: Stereo Quality Enhancement for Martian Images Using Bi-level Cross-view Attention
Viaarxiv icon

Trusted Mamba Contrastive Network for Multi-View Clustering

Add code
Dec 21, 2024
Figure 1 for Trusted Mamba Contrastive Network for Multi-View Clustering
Figure 2 for Trusted Mamba Contrastive Network for Multi-View Clustering
Figure 3 for Trusted Mamba Contrastive Network for Multi-View Clustering
Figure 4 for Trusted Mamba Contrastive Network for Multi-View Clustering
Viaarxiv icon

Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios

Add code
Nov 05, 2024
Figure 1 for Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Figure 2 for Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Figure 3 for Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Figure 4 for Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Viaarxiv icon

Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality

Add code
Oct 07, 2024
Figure 1 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Figure 2 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Figure 3 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Figure 4 for Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
Viaarxiv icon

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

Add code
Oct 04, 2024
Figure 1 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Figure 2 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Figure 3 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Figure 4 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Viaarxiv icon

Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis

Add code
Sep 05, 2024
Figure 1 for Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis
Figure 2 for Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis
Figure 3 for Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis
Figure 4 for Multimodal Laryngoscopic Video Analysis for Assisted Diagnosis of Vocal Cord Paralysis
Viaarxiv icon

Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models

Add code
Aug 18, 2024
Figure 1 for Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models
Figure 2 for Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models
Figure 3 for Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models
Figure 4 for Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models
Viaarxiv icon

MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image

Add code
Apr 15, 2024
Viaarxiv icon