Picture for Jungong Han

Jungong Han

[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs

Add code
Dec 08, 2024
Viaarxiv icon

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Add code
Dec 04, 2024
Viaarxiv icon

Hybrid Discriminative Attribute-Object Embedding Network for Compositional Zero-Shot Learning

Add code
Nov 28, 2024
Viaarxiv icon

Relation-Aware Meta-Learning for Zero-shot Sketch-Based Image Retrieval

Add code
Nov 28, 2024
Viaarxiv icon

Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

Add code
Nov 26, 2024
Figure 1 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Figure 2 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Figure 3 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Figure 4 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Viaarxiv icon

Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning

Add code
Nov 11, 2024
Figure 1 for Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning
Figure 2 for Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning
Figure 3 for Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning
Figure 4 for Multi-Stage Knowledge Integration of Vision-Language Models for Continual Learning
Viaarxiv icon

A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization

Add code
Oct 29, 2024
Figure 1 for A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Figure 2 for A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Figure 3 for A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Figure 4 for A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Viaarxiv icon

Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding

Add code
Oct 19, 2024
Viaarxiv icon

Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection

Add code
Sep 10, 2024
Figure 1 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Figure 2 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Figure 3 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Figure 4 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Viaarxiv icon

LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image

Add code
Aug 14, 2024
Figure 1 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 2 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 3 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 4 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Viaarxiv icon