Picture for Yizhang Jin

Yizhang Jin

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Viaarxiv icon

Generalized Category Discovery in Semantic Segmentation

Add code
Nov 20, 2023
Viaarxiv icon