Picture for Zhuohan Gu

Zhuohan Gu

RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation

Add code
Dec 13, 2024
Viaarxiv icon

LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts

Add code
Nov 21, 2024
Viaarxiv icon