Picture for Chaoyi Jiang

Chaoyi Jiang

Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation

Add code
Nov 26, 2024
Viaarxiv icon

CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data

Add code
Jul 11, 2024
Viaarxiv icon