Picture for Zhenye Gan

Zhenye Gan

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Add code
Oct 21, 2024
Viaarxiv icon

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Viaarxiv icon

PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision

Add code
Jul 09, 2024
Viaarxiv icon

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Add code
Jun 06, 2024
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Viaarxiv icon

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection

Add code
Apr 14, 2024
Viaarxiv icon

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection

Add code
Mar 19, 2024
Viaarxiv icon

DMAD: Dual Memory Bank for Real-World Anomaly Detection

Add code
Mar 19, 2024
Viaarxiv icon

Hear to Segment: Unmixing the Audio to Guide the Semantic Segmentation

Add code
May 12, 2023
Viaarxiv icon

MixTeacher: Mining Promising Labels with Mixed Scale Teacher for Semi-Supervised Object Detection

Add code
Mar 16, 2023
Viaarxiv icon