Picture for Xirong Li

Xirong Li

Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization

Add code
Nov 15, 2024
Viaarxiv icon

Beyond Coarse-Grained Matching in Video-Text Retrieval

Add code
Oct 17, 2024
Figure 1 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Figure 2 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Figure 3 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Figure 4 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Viaarxiv icon

Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions

Add code
Oct 15, 2024
Viaarxiv icon

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching

Add code
Aug 23, 2024
Viaarxiv icon

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval

Add code
Aug 06, 2024
Figure 1 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 2 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 3 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 4 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Viaarxiv icon

PhD: A Prompted Visual Hallucination Evaluation Dataset

Add code
Mar 17, 2024
Viaarxiv icon

Adaptive Fusion of Radiomics and Deep Features for Lung Adenocarcinoma Subtype Recognition

Add code
Aug 27, 2023
Viaarxiv icon

TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval

Add code
Aug 02, 2023
Viaarxiv icon

Cross-domain Collaborative Learning for Recognizing Multiple Retinal Diseases from Wide-Field Fundus Images

Add code
May 14, 2023
Viaarxiv icon

Renmin University of China at TRECVID 2022: Improving Video Search by Feature Fusion and Negation Understanding

Add code
Nov 28, 2022
Viaarxiv icon