Picture for Hanning Chen

Hanning Chen

LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation

Add code
Apr 15, 2025
Viaarxiv icon

Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?

Add code
Jan 27, 2025
Viaarxiv icon

Tell Me What to Track: Infusing Robust Language Guidance for Enhanced Referring Multi-Object Tracking

Add code
Dec 17, 2024
Viaarxiv icon

Expanding Event Modality Applications through a Robust CLIP-Based Encoder

Add code
Dec 04, 2024
Viaarxiv icon

VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation

Add code
Sep 13, 2024
Figure 1 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Figure 2 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Figure 3 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Figure 4 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Viaarxiv icon

Promoting Fairness in Link Prediction with Graph Enhancement

Add code
Sep 13, 2024
Viaarxiv icon

Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach

Add code
Sep 01, 2024
Viaarxiv icon

Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces

Add code
Aug 13, 2024
Viaarxiv icon

EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration

Add code
Mar 26, 2024
Viaarxiv icon

TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection

Add code
Mar 12, 2024
Viaarxiv icon