Picture for Keisuke Kamahori

Keisuke Kamahori

TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval

Add code
Feb 28, 2025
Viaarxiv icon

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Add code
Feb 27, 2025
Viaarxiv icon

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

Add code
Feb 10, 2024
Viaarxiv icon