Picture for Mingze Zhou

Mingze Zhou

WorldGPT: Empowering LLM as Multimodal World Model

Add code
Apr 28, 2024
Viaarxiv icon

Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks

Add code
Nov 09, 2023
Viaarxiv icon