Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Clive Cox

Desiderata for next generation of ML model serving

Oct 26, 2022

Sherif Akoush, Andrei Paleyes, Arnaud Van Looveren, Clive Cox

Figure 1 for Desiderata for next generation of ML model serving

Abstract:Inference is a significant part of ML software infrastructure. Despite the variety of inference frameworks available, the field as a whole can be considered in its early days. This paper puts forth a range of important qualities that next generation of inference platforms should be aiming for. We present our rationale for the importance of each quality, and discuss ways to achieve it in practice. An overarching design pattern is data-centricity, which enables smarter monitoring in ML system operation.

* Accepted at NeurIPS 2022 Workshop on Challenges in Deploying and Monitoring Machine Learning Systems

Via

Access Paper or Ask Questions

Serverless inferencing on Kubernetes

Jul 24, 2020

Clive Cox, Dan Sun, Ellis Tarn, Animesh Singh, Rakesh Kelkar, David Goodwin

Figure 1 for Serverless inferencing on Kubernetes

Abstract:Organisations are increasingly putting machine learning models into production at scale. The increasing popularity of serverless scale-to-zero paradigms presents an opportunity for deploying machine learning models to help mitigate infrastructure costs when many models may not be in continuous use. We will discuss the KFServing project which builds on the KNative serverless paradigm to provide a serverless machine learning inference solution that allows a consistent and simple interface for data scientists to deploy their models. We will show how it solves the challenges of autoscaling GPU based inference and discuss some of the lessons learnt from using it in production.

* 4 pages, 1 figure, presented at workshop on "Challenges in Deploying and Monitoring Machine Learning System" at ICML 2020

Via

Access Paper or Ask Questions

Monitoring and explainability of models in production

Jul 13, 2020

Janis Klaise, Arnaud Van Looveren, Clive Cox, Giovanni Vacanti, Alexandru Coca

Figure 1 for Monitoring and explainability of models in production

Figure 2 for Monitoring and explainability of models in production

Abstract:The machine learning lifecycle extends beyond the deployment stage. Monitoring deployed models is crucial for continued provision of high quality machine learning enabled services. Key areas include model performance and data monitoring, detecting outliers and data drift using statistical techniques, and providing explanations of historic predictions. We discuss the challenges to successful implementation of solutions in each of these areas with some recent examples of production ready solutions using open source tools.

* Workshop on Challenges in Deploying and Monitoring Machine Learning Systems (ICML 2020)

Via

Access Paper or Ask Questions