Picture for Wenjun Huang

Wenjun Huang

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment

Add code
Oct 08, 2024
Viaarxiv icon

VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation

Add code
Sep 13, 2024
Figure 1 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Figure 2 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Figure 3 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Figure 4 for VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Viaarxiv icon

3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos

Add code
Sep 02, 2024
Viaarxiv icon

Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach

Add code
Sep 01, 2024
Viaarxiv icon

ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2

Add code
Jul 29, 2024
Viaarxiv icon

EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration

Add code
Mar 26, 2024
Viaarxiv icon

TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection

Add code
Mar 12, 2024
Viaarxiv icon

HEAL: Brain-inspired Hyperdimensional Efficient Active Learning

Add code
Feb 17, 2024
Viaarxiv icon

A Plug-in Tiny AI Module for Intelligent and Selective Sensor Data Transmission

Add code
Feb 03, 2024
Viaarxiv icon

Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling

Add code
Sep 20, 2023
Viaarxiv icon