TY - JOUR
T1 - Host load prediction using linear models
AU - Dinda, Peter A.
AU - O'Hallaron, David R.
PY - 2000
Y1 - 2000
N2 - This paper evaluates linear models for predicting the Digital Unix fivedashsecond host load average from 1 to 30 seconds into the future. A detailed statistical study of a large number of long, fine grain load traces from a variety of real machines leads to consideration of the Box–Jenkins models (AR, MA, ARMA, ARIMA), and the ARFIMA models (due to selfdashsimilarity.) We also consider a simple windoweddashmean model. The computational requirements of these models span a wide range, making some more practical than others for incorporation into an online prediction system. We rigorously evaluate the predictive power of the models by running a large number of randomized testcases on the load traces and then datadashmining their results. The main conclusions are that load is consistently predictable to a very useful degree, and that the simple, practical models such as AR are sufficient for host load prediction. We recommend AR(16) models or better for host load prediction. We implement an online host load prediction system around the AR(16) model and evaluate its overhead, finding that it uses miniscule amounts of CPU time and network bandwidth.
AB - This paper evaluates linear models for predicting the Digital Unix fivedashsecond host load average from 1 to 30 seconds into the future. A detailed statistical study of a large number of long, fine grain load traces from a variety of real machines leads to consideration of the Box–Jenkins models (AR, MA, ARMA, ARIMA), and the ARFIMA models (due to selfdashsimilarity.) We also consider a simple windoweddashmean model. The computational requirements of these models span a wide range, making some more practical than others for incorporation into an online prediction system. We rigorously evaluate the predictive power of the models by running a large number of randomized testcases on the load traces and then datadashmining their results. The main conclusions are that load is consistently predictable to a very useful degree, and that the simple, practical models such as AR are sufficient for host load prediction. We recommend AR(16) models or better for host load prediction. We implement an online host load prediction system around the AR(16) model and evaluate its overhead, finding that it uses miniscule amounts of CPU time and network bandwidth.
U2 - 10.1023/A:1019048724544
DO - 10.1023/A:1019048724544
M3 - Article
SN - 1386-7857
VL - 3
SP - 265
EP - 280
JO - Cluster Computing
JF - Cluster Computing
ER -