Picture for Leyang Shen

Leyang Shen

LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant

Add code
Mar 05, 2025
Viaarxiv icon

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

Add code
Jul 17, 2024
Viaarxiv icon

LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Add code
Nov 26, 2023
Figure 1 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Figure 2 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Figure 3 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Figure 4 for LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Viaarxiv icon