Cross Modal Information Retrieval


VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration

Add code
Feb 04, 2026
Viaarxiv icon

Toward Effective Multimodal Graph Foundation Model: A Divide-and-Conquer Based Approach

Add code
Feb 04, 2026
Viaarxiv icon

ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval

Add code
Feb 02, 2026
Viaarxiv icon

CoVA: Text-Guided Composed Video Retrieval for Audio-Visual Content

Add code
Jan 30, 2026
Viaarxiv icon

When Vision Meets Texts in Listwise Reranking

Add code
Jan 28, 2026
Viaarxiv icon

Agentic Very Long Video Understanding

Add code
Jan 26, 2026
Viaarxiv icon

Enginuity: Building an Open Multi-Domain Dataset of Complex Engineering Diagrams

Add code
Jan 19, 2026
Viaarxiv icon

RAG-GFM: Overcoming In-Memory Bottlenecks in Graph Foundation Models via Retrieval-Augmented Generation

Add code
Jan 24, 2026
Viaarxiv icon

Overcoming In-Memory Bottlenecks in Graph Foundation Models via Retrieval-Augmented Generation

Add code
Jan 21, 2026
Viaarxiv icon

Auditory Brain Passage Retrieval: Cross-Sensory EEG Training for Neural Information Retrieval

Add code
Jan 20, 2026
Viaarxiv icon