Picture for Zhifei Xie

Zhifei Xie

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Add code
Oct 16, 2024
Viaarxiv icon

Mini-Omni2: Towards Open-source GPT-4o Model with Vision, Speech and Duplex

Add code
Oct 15, 2024
Viaarxiv icon

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Add code
Aug 30, 2024
Viaarxiv icon

DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework

Add code
Aug 21, 2024
Viaarxiv icon