Picture for Nipun Kwatra

Nipun Kwatra

Microsoft

Accuracy is Not All You Need

Add code
Jul 12, 2024
Figure 1 for Accuracy is Not All You Need
Figure 2 for Accuracy is Not All You Need
Figure 3 for Accuracy is Not All You Need
Figure 4 for Accuracy is Not All You Need
Viaarxiv icon

Metron: Holistic Performance Evaluation Framework for LLM Inference Systems

Add code
Jul 09, 2024
Figure 1 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Figure 2 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Figure 3 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Figure 4 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Viaarxiv icon

Vidur: A Large-Scale Simulation Framework For LLM Inference

Add code
May 08, 2024
Figure 1 for Vidur: A Large-Scale Simulation Framework For LLM Inference
Figure 2 for Vidur: A Large-Scale Simulation Framework For LLM Inference
Figure 3 for Vidur: A Large-Scale Simulation Framework For LLM Inference
Figure 4 for Vidur: A Large-Scale Simulation Framework For LLM Inference
Viaarxiv icon

Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve

Add code
Mar 04, 2024
Figure 1 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Figure 2 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Figure 3 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Figure 4 for Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Viaarxiv icon

SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills

Add code
Aug 31, 2023
Figure 1 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Figure 2 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Figure 3 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Figure 4 for SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills
Viaarxiv icon

"Can't Take the Pressure?": Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis

Add code
Apr 23, 2023
Figure 1 for "Can't Take the Pressure?": Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis
Figure 2 for "Can't Take the Pressure?": Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis
Figure 3 for "Can't Take the Pressure?": Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis
Figure 4 for "Can't Take the Pressure?": Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis
Viaarxiv icon

Towards Automating Retinoscopy for Refractive Error Diagnosis

Add code
Aug 10, 2022
Figure 1 for Towards Automating Retinoscopy for Refractive Error Diagnosis
Figure 2 for Towards Automating Retinoscopy for Refractive Error Diagnosis
Figure 3 for Towards Automating Retinoscopy for Refractive Error Diagnosis
Figure 4 for Towards Automating Retinoscopy for Refractive Error Diagnosis
Viaarxiv icon

Distance Learner: Incorporating Manifold Prior to Model Training

Add code
Jul 14, 2022
Figure 1 for Distance Learner: Incorporating Manifold Prior to Model Training
Figure 2 for Distance Learner: Incorporating Manifold Prior to Model Training
Figure 3 for Distance Learner: Incorporating Manifold Prior to Model Training
Figure 4 for Distance Learner: Incorporating Manifold Prior to Model Training
Viaarxiv icon

Keratoconus Classifier for Smartphone-based Corneal Topographer

Add code
May 07, 2022
Figure 1 for Keratoconus Classifier for Smartphone-based Corneal Topographer
Figure 2 for Keratoconus Classifier for Smartphone-based Corneal Topographer
Figure 3 for Keratoconus Classifier for Smartphone-based Corneal Topographer
Figure 4 for Keratoconus Classifier for Smartphone-based Corneal Topographer
Viaarxiv icon

Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads

Add code
Feb 21, 2022
Figure 1 for Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads
Figure 2 for Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads
Figure 3 for Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads
Figure 4 for Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads
Viaarxiv icon