Picture for Lidan Shou

Lidan Shou

LATTEArena: An Evaluation Framework for LLM-powered Tabular Feature Engineering (Extended Version)

Add code
Jun 08, 2026
Viaarxiv icon

MedMemoryBench: Benchmarking Agent Memory in Personalized Healthcare

Add code
May 12, 2026
Viaarxiv icon

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

Add code
Apr 07, 2026
Viaarxiv icon

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Add code
Apr 07, 2026
Viaarxiv icon

See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs

Add code
Apr 07, 2026
Viaarxiv icon

SafeLoad: Efficient Admission Control Framework for Identifying Memory-Overloading Queries in Cloud Data Warehouses

Add code
Jan 05, 2026
Viaarxiv icon

FloE: On-the-Fly MoE Inference on Memory-constrained GPU

Add code
May 12, 2025
Viaarxiv icon

FloE: On-the-Fly MoE Inference

Add code
May 09, 2025
Viaarxiv icon

CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning

Add code
Apr 24, 2025
Figure 1 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 2 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 3 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Figure 4 for CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning
Viaarxiv icon

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models

Add code
Apr 24, 2025
Viaarxiv icon