Picture for Zhenye Gan

Zhenye Gan

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network

Add code
Nov 24, 2024
Figure 1 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Figure 2 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Figure 3 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Figure 4 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Viaarxiv icon

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Add code
Oct 21, 2024
Viaarxiv icon

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Viaarxiv icon

PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision

Add code
Jul 09, 2024
Viaarxiv icon

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Add code
Jun 06, 2024
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Viaarxiv icon

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection

Add code
Apr 14, 2024
Figure 1 for MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
Figure 2 for MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
Figure 3 for MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
Figure 4 for MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
Viaarxiv icon

DMAD: Dual Memory Bank for Real-World Anomaly Detection

Add code
Mar 19, 2024
Viaarxiv icon

Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection

Add code
Mar 19, 2024
Viaarxiv icon

Hear to Segment: Unmixing the Audio to Guide the Semantic Segmentation

Add code
May 12, 2023
Viaarxiv icon