Picture for Lidan Shou

Lidan Shou

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Add code
Apr 07, 2026
Viaarxiv icon

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

Add code
Apr 07, 2026
Viaarxiv icon

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

Add code
Apr 07, 2026
Viaarxiv icon

SafeLoad: Efficient Admission Control Framework for Identifying Memory-Overloading Queries in Cloud Data Warehouses

Add code
Jan 05, 2026
Viaarxiv icon

FloE: On-the-Fly MoE Inference on Memory-constrained GPU

Add code
May 12, 2025
Viaarxiv icon

FloE: On-the-Fly MoE Inference

Add code
May 09, 2025
Viaarxiv icon

CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning

Add code
Apr 24, 2025
Figure 1 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 2 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 3 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 4 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Viaarxiv icon

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models

Add code
Apr 24, 2025
Viaarxiv icon

NLCTables: A Dataset for Marrying Natural Language Conditions with Table Discovery

Add code
Apr 22, 2025
Figure 1 for NLCTables: A Dataset for Marrying Natural Language Conditions with Table Discovery
Figure 2 for NLCTables: A Dataset for Marrying Natural Language Conditions with Table Discovery
Figure 3 for NLCTables: A Dataset for Marrying Natural Language Conditions with Table Discovery
Figure 4 for NLCTables: A Dataset for Marrying Natural Language Conditions with Table Discovery
Viaarxiv icon

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon