Picture for Minxin Nie

Minxin Nie

PIP-MM: Pre-Integrating Prompt Information into Visual Encoding via Existing MLLM Structures

Add code
Oct 30, 2024
Viaarxiv icon