Picture for Kaihang Pan

Kaihang Pan

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Add code
Sep 30, 2024
Viaarxiv icon

Auto-Encoding Morph-Tokens for Multimodal LLM

Add code
May 03, 2024
Viaarxiv icon

Improving Vision Anomaly Detection with the Guidance of Language Modality

Add code
Oct 04, 2023
Viaarxiv icon

ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval

Add code
Aug 19, 2023
Viaarxiv icon

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions

Add code
Aug 10, 2023
Viaarxiv icon

Meta-augmented Prompt Tuning for Better Few-shot Learning

Add code
Mar 28, 2023
Viaarxiv icon