Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Abdelhakim Hafid

Fully Distributed Fog Load Balancing with Multi-Agent Reinforcement Learning

May 15, 2024

Maad Ebrahim, Abdelhakim Hafid

Abstract:Real-time Internet of Things (IoT) applications require real-time support to handle the ever-growing demand for computing resources to process IoT workloads. Fog Computing provides high availability of such resources in a distributed manner. However, these resources must be efficiently managed to distribute unpredictable traffic demands among heterogeneous Fog resources. This paper proposes a fully distributed load-balancing solution with Multi-Agent Reinforcement Learning (MARL) that intelligently distributes IoT workloads to optimize the waiting time while providing fair resource utilization in the Fog network. These agents use transfer learning for life-long self-adaptation to dynamic changes in the environment. By leveraging distributed decision-making, MARL agents effectively minimize the waiting time compared to a single centralized agent solution and other baselines, enhancing end-to-end execution delay. Besides performance gain, a fully distributed solution allows for a global-scale implementation where agents can work independently in small collaboration regions, leveraging nearby local resources. Furthermore, we analyze the impact of a realistic frequency to observe the state of the environment, unlike the unrealistic common assumption in the literature of having observations readily available in real-time for every required action. The findings highlight the trade-off between realism and performance using an interval-based Gossip-based multi-casting protocol against assuming real-time observation availability for every generated workload.

* Submitted to IEEE IoTJ with 13 pages, 11 figures, and 3 tables

Via

Access Paper or Ask Questions

SLA Violation Prediction In Cloud Computing: A Machine Learning Perspective

Nov 30, 2016

Reyhane Askari Hemmat, Abdelhakim Hafid

Figure 1 for SLA Violation Prediction In Cloud Computing: A Machine Learning Perspective

Figure 2 for SLA Violation Prediction In Cloud Computing: A Machine Learning Perspective

Figure 3 for SLA Violation Prediction In Cloud Computing: A Machine Learning Perspective

Figure 4 for SLA Violation Prediction In Cloud Computing: A Machine Learning Perspective

Abstract:Service level agreement (SLA) is an essential part of cloud systems to ensure maximum availability of services for customers. With a violation of SLA, the provider has to pay penalties. In this paper, we explore two machine learning models: Naive Bayes and Random Forest Classifiers to predict SLA violations. Since SLA violations are a rare event in the real world (~0.2 %), the classification task becomes more challenging. In order to overcome these challenges, we use several re-sampling methods. We find that random forests with SMOTE-ENN re-sampling have the best performance among other methods with the accuracy of 99.88 % and F_1 score of 0.9980.

Via

Access Paper or Ask Questions