Predictive Auto-scaler for Kubernetes Cloud
Publication Date
1-1-2023
Document Type
Conference Proceeding
Publication Title
SysCon 2023 - 17th Annual IEEE International Systems Conference, Proceedings
DOI
10.1109/SysCon53073.2023.10131106
Abstract
One of the basic requirements with adapting to cloud technology is to find an optimal resource allocation based on the dynamic workload. The default functioning of Kubernetes Horizontal Pod Auto-scaling in cloud is scaling of its pods only when the threshold of the cluster/application is crossed in order to adapt to increasing workload. Rather we want to deploy a proactive provisioning framework based on machine learning based predictions. We have demonstrated a novel deep learning framework based on a transformer in the area of dynamic workload predictions and showed how to apply the results to a custom auto-scaler in cloud. Our Framework builds time-series predictive models in machine learning such as ARIMA, LSTM, Bi-LSTM and transformer models. The dynamic scaling framework applies machine learning algorithms and presents recommendations to make proactive and smart decisions. Though the transformer model has been used in NLP and Vision applications mostly, we showed that the transformer based model can produce the most effective results in cloud workload predictions.
Keywords
artificial intelligence, auto-scaling, containerization, prediction, re-source management, time-series analysis, time-series forecasting
Department
Applied Data Science
Recommended Citation
Simon Shim, Ankit Dhokariya, Devangi Doshi, Sarvesh Upadhye, Varun Patwari, and Ji Yong Park. "Predictive Auto-scaler for Kubernetes Cloud" SysCon 2023 - 17th Annual IEEE International Systems Conference, Proceedings (2023). https://doi.org/10.1109/SysCon53073.2023.10131106