Picture for Kai Huang

Kai Huang

AirCache: Activating Inter-modal Relevancy KV Cache Compression for Efficient Large Vision-Language Model Inference

Add code
Mar 31, 2025
Viaarxiv icon

Pick-and-place Manipulation Across Grippers Without Retraining: A Learning-optimization Diffusion Policy Approach

Add code
Feb 21, 2025
Viaarxiv icon

A Real-Time System for Scheduling and Managing UAV Delivery in Urban

Add code
Dec 16, 2024
Figure 1 for A Real-Time System for Scheduling and Managing UAV Delivery in Urban
Figure 2 for A Real-Time System for Scheduling and Managing UAV Delivery in Urban
Figure 3 for A Real-Time System for Scheduling and Managing UAV Delivery in Urban
Figure 4 for A Real-Time System for Scheduling and Managing UAV Delivery in Urban
Viaarxiv icon

KaLDeX: Kalman Filter based Linear Deformable Cross Attention for Retina Vessel Segmentation

Add code
Oct 28, 2024
Viaarxiv icon

Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences

Add code
Oct 28, 2024
Viaarxiv icon

Flow-Inspired Lightweight Multi-Robot Real-Time Scheduling Planner

Add code
Sep 11, 2024
Viaarxiv icon

4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole

Add code
Sep 03, 2024
Figure 1 for 4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole
Figure 2 for 4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole
Figure 3 for 4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole
Figure 4 for 4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole
Viaarxiv icon

Meta-Learning Empowered Graph Neural Networks for Radio Resource Management

Add code
Aug 29, 2024
Viaarxiv icon

Beam Prediction based on Large Language Models

Add code
Aug 16, 2024
Figure 1 for Beam Prediction based on Large Language Models
Figure 2 for Beam Prediction based on Large Language Models
Figure 3 for Beam Prediction based on Large Language Models
Figure 4 for Beam Prediction based on Large Language Models
Viaarxiv icon

Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval

Add code
Jul 01, 2024
Viaarxiv icon