Picture for Yizhang Jin

Yizhang Jin

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Figure 1 for Efficient Multimodal Large Language Models: A Survey
Figure 2 for Efficient Multimodal Large Language Models: A Survey
Figure 3 for Efficient Multimodal Large Language Models: A Survey
Figure 4 for Efficient Multimodal Large Language Models: A Survey
Viaarxiv icon

Generalized Category Discovery in Semantic Segmentation

Add code
Nov 20, 2023
Viaarxiv icon