Picture for Ganesh Ananthanarayanan

Ganesh Ananthanarayanan

RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation

Add code
Dec 13, 2024
Viaarxiv icon

Distributed AI Platform for the 6G RAN

Add code
Oct 01, 2024
Viaarxiv icon

HawkVision: Low-Latency Modeless Edge AI Serving

Add code
May 29, 2024
Viaarxiv icon

CacheGen: Fast Context Loading for Language Model Applications

Add code
Oct 11, 2023
Viaarxiv icon

OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation

Add code
Oct 03, 2023
Viaarxiv icon

GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge

Add code
Jan 19, 2022
Figure 1 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Figure 2 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Figure 3 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Figure 4 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Viaarxiv icon

Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers

Add code
Dec 19, 2020
Figure 1 for Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers
Figure 2 for Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers
Figure 3 for Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers
Figure 4 for Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers
Viaarxiv icon

Visor: Privacy-Preserving Video Analytics as a Cloud Service

Add code
Jun 23, 2020
Figure 1 for Visor: Privacy-Preserving Video Analytics as a Cloud Service
Figure 2 for Visor: Privacy-Preserving Video Analytics as a Cloud Service
Figure 3 for Visor: Privacy-Preserving Video Analytics as a Cloud Service
Figure 4 for Visor: Privacy-Preserving Video Analytics as a Cloud Service
Viaarxiv icon

Machine Learning at the Network Edge: A Survey

Add code
Jul 31, 2019
Figure 1 for Machine Learning at the Network Edge: A Survey
Figure 2 for Machine Learning at the Network Edge: A Survey
Figure 3 for Machine Learning at the Network Edge: A Survey
Figure 4 for Machine Learning at the Network Edge: A Survey
Viaarxiv icon

Collage Inference: Achieving low tail latency during distributed image classification using coded redundancy models

Add code
Jun 05, 2019
Figure 1 for Collage Inference: Achieving low tail latency during distributed image classification using coded redundancy models
Figure 2 for Collage Inference: Achieving low tail latency during distributed image classification using coded redundancy models
Figure 3 for Collage Inference: Achieving low tail latency during distributed image classification using coded redundancy models
Figure 4 for Collage Inference: Achieving low tail latency during distributed image classification using coded redundancy models
Viaarxiv icon