Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Wasserstein Adversarial Transformer for Cloud Workload Prediction

Mar 12, 2022

Shivani Arbat, Vinodh Kumaran Jayakumar, Jaewoo Lee, Wei Wang, In Kee Kim

Figure 1 for Wasserstein Adversarial Transformer for Cloud Workload Prediction

Figure 2 for Wasserstein Adversarial Transformer for Cloud Workload Prediction

Figure 3 for Wasserstein Adversarial Transformer for Cloud Workload Prediction

Figure 4 for Wasserstein Adversarial Transformer for Cloud Workload Prediction

Share this with someone who'll enjoy it:

Abstract:Predictive Virtual Machine (VM) auto-scaling is a promising technique to optimize cloud applications operating costs and performance. Understanding the job arrival rate is crucial for accurately predicting future changes in cloud workloads and proactively provisioning and de-provisioning VMs for hosting the applications. However, developing a model that accurately predicts cloud workload changes is extremely challenging due to the dynamic nature of cloud workloads. Long-Short-Term-Memory (LSTM) models have been developed for cloud workload prediction. Unfortunately, the state-of-the-art LSTM model leverages recurrences to predict, which naturally adds complexity and increases the inference overhead as input sequences grow longer. To develop a cloud workload prediction model with high accuracy and low inference overhead, this work presents a novel time-series forecasting model called WGAN-gp Transformer, inspired by the Transformer network and improved Wasserstein-GANs. The proposed method adopts a Transformer network as a generator and a multi-layer perceptron as a critic. The extensive evaluations with real-world workload traces show WGAN-gp Transformer achieves 5 times faster inference time with up to 5.1 percent higher prediction accuracy against the state-of-the-art approach. We also apply WGAN-gp Transformer to auto-scaling mechanisms on Google cloud platforms, and the WGAN-gp Transformer-based auto-scaling mechanism outperforms the LSTM-based mechanism by significantly reducing VM over-provisioning and under-provisioning rates.

* The Thirty-Fourth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-22) (presented at AAAI-2022)

View paper on

Share this with someone who'll enjoy it:

Title:Wasserstein Adversarial Transformer for Cloud Workload Prediction

Paper and Code