TY - GEN
T1 - A Low-Latency Edge-Cloud Serverless Computing Framework with a Multi-Armed Bandit Scheduler
AU - Chigu, Justin
AU - El-Mahdy, Ahmed
AU - Mokhtar, Bassem
AU - Elsabrouty, Maha
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Recently, serverless computing, particularly the Function-as-a-Service (FaaS) programming model, has become an important emerging technology for developers and cloud providers. It relieves developers from the burden of explicitly managing the computing resources and provides more accurate billing of the exact service and execution time. However, as a side effect of the service-side resource management, the system often inactivates a set of execution dockers, introducing a significant cold start time for the following invocation, resulting in unpredictable latency. Existing solutions mainly rely on improving an end-point management policy or scheduling into other same-tier endpoints and, more recently, considering a promising but simplified edge-cloud tier with available management information. The latter can mitigate latency by relying on offloading to a resource-rich cloud. In this paper, we consider extending the two-tier edge-cloud approach to not rely on any management information from the service side or account for function-dependent communication latency but to rely on a scheduler based on the multi-armed bandit (MAB) upper confidence bound (UCB) algorithm that dynamically learns from the prevailing real-time conditions to choose the best cloud platform to execute functions with minimal latency. A test bed was implemented, comprising an OpenWhisk system deployed on a local Kubernetes cluster (kind) and two commercial FaaS systems: Amazon Web Services (AWS) Lambda and Google Cloud Functions (GCF). The scheduler was tested in real-time using the serverless benchmark suite (SeBS). Our results show that the MAB UCB is superior to single-tier systems. The MAB UCB can achieve execution time within a close margin of an oracle scheduler and also can fail in some extreme cases.
AB - Recently, serverless computing, particularly the Function-as-a-Service (FaaS) programming model, has become an important emerging technology for developers and cloud providers. It relieves developers from the burden of explicitly managing the computing resources and provides more accurate billing of the exact service and execution time. However, as a side effect of the service-side resource management, the system often inactivates a set of execution dockers, introducing a significant cold start time for the following invocation, resulting in unpredictable latency. Existing solutions mainly rely on improving an end-point management policy or scheduling into other same-tier endpoints and, more recently, considering a promising but simplified edge-cloud tier with available management information. The latter can mitigate latency by relying on offloading to a resource-rich cloud. In this paper, we consider extending the two-tier edge-cloud approach to not rely on any management information from the service side or account for function-dependent communication latency but to rely on a scheduler based on the multi-armed bandit (MAB) upper confidence bound (UCB) algorithm that dynamically learns from the prevailing real-time conditions to choose the best cloud platform to execute functions with minimal latency. A test bed was implemented, comprising an OpenWhisk system deployed on a local Kubernetes cluster (kind) and two commercial FaaS systems: Amazon Web Services (AWS) Lambda and Google Cloud Functions (GCF). The scheduler was tested in real-time using the serverless benchmark suite (SeBS). Our results show that the MAB UCB is superior to single-tier systems. The MAB UCB can achieve execution time within a close margin of an oracle scheduler and also can fail in some extreme cases.
KW - edge computing
KW - Function-as-a-Service
KW - latency.
KW - scheduling
KW - serverless computing
UR - http://www.scopus.com/inward/record.url?scp=85199979286&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85199979286&partnerID=8YFLogxK
U2 - 10.1109/IWCMC61514.2024.10592558
DO - 10.1109/IWCMC61514.2024.10592558
M3 - Conference contribution
AN - SCOPUS:85199979286
T3 - 20th International Wireless Communications and Mobile Computing Conference, IWCMC 2024
SP - 1655
EP - 1660
BT - 20th International Wireless Communications and Mobile Computing Conference, IWCMC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 20th IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2024
Y2 - 27 May 2024 through 31 May 2024
ER -