Picture for Ravi Netravali

Ravi Netravali

Princeton University

RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation

Add code
Dec 13, 2024
Viaarxiv icon

Marconi: Prefix Caching for the Era of Hybrid LLMs

Add code
Nov 28, 2024
Viaarxiv icon

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

Add code
Dec 08, 2023
Viaarxiv icon

MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations

Add code
Apr 04, 2023
Viaarxiv icon

Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection

Add code
Jun 07, 2022
Figure 1 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Figure 2 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Figure 3 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Figure 4 for Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection
Viaarxiv icon

Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs

Add code
Apr 26, 2022
Figure 1 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Figure 2 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Figure 3 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Figure 4 for Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs
Viaarxiv icon

GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge

Add code
Jan 19, 2022
Figure 1 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Figure 2 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Figure 3 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Figure 4 for GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
Viaarxiv icon

Revelio: ML-Generated Debugging Queries for Distributed Systems

Add code
Jun 28, 2021
Figure 1 for Revelio: ML-Generated Debugging Queries for Distributed Systems
Figure 2 for Revelio: ML-Generated Debugging Queries for Distributed Systems
Figure 3 for Revelio: ML-Generated Debugging Queries for Distributed Systems
Figure 4 for Revelio: ML-Generated Debugging Queries for Distributed Systems
Viaarxiv icon

Boggart: Accelerating Retrospective Video Analytics via Model-Agnostic Ingest Processing

Add code
Jun 21, 2021
Figure 1 for Boggart: Accelerating Retrospective Video Analytics via Model-Agnostic Ingest Processing
Figure 2 for Boggart: Accelerating Retrospective Video Analytics via Model-Agnostic Ingest Processing
Figure 3 for Boggart: Accelerating Retrospective Video Analytics via Model-Agnostic Ingest Processing
Figure 4 for Boggart: Accelerating Retrospective Video Analytics via Model-Agnostic Ingest Processing
Viaarxiv icon

Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads

Add code
May 25, 2021
Figure 1 for Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads
Figure 2 for Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads
Figure 3 for Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads
Figure 4 for Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads
Viaarxiv icon