Picture for YiQing Cai

YiQing Cai

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models

Add code
Nov 14, 2024
Viaarxiv icon

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

Add code
Aug 05, 2024
Viaarxiv icon