Picture for Chaoyi Jiang

Chaoyi Jiang

DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding

Add code
Apr 08, 2025
Viaarxiv icon

Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation

Add code
Nov 26, 2024
Viaarxiv icon

CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data

Add code
Jul 11, 2024
Viaarxiv icon