Picture for Zhiqi Ge

Zhiqi Ge

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

WorldGPT: Empowering LLM as Multimodal World Model

Add code
Apr 28, 2024
Viaarxiv icon

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions

Add code
Aug 10, 2023
Viaarxiv icon