TY - JOUR
T1 - Use of a metalearner to predict emergency medical services demand in an urban setting
AU - Ramgopal, Sriram
AU - Westling, Ted
AU - Siripong, Nalyn
AU - Salcido, David D.
AU - Martin-Gill, Christian
N1 - Publisher Copyright:
© 2021
PY - 2021/8
Y1 - 2021/8
N2 - Objective: To develop and internally validate a metalearner algorithm to predict the hourly rate of emergency medical services (EMS) dispatches in an urban setting. Methods: We performed an analysis of EMS data from New York City between years 2015-2019. Our outcome was hourly EMS dispatches, expressed as continuous data. Hours were split into derivation (75%) and validation (25%) datasets. Candidate variables included averages of prior rates, temporal and weather characteristics. We used a metalearner to evaluate and aggregate individual learners (generalized linear model, generalized additive model, random forest, multivariable adaptive regression splines, and extreme gradient boost). Four models were investigated: 1) temporal variables, 2) weather and temporal variables, and datasets in which weather data were lagged by 3) six and 4) twelve hours. In exploratory analyses, we constructed learners for high acuity and trauma encounters. Results: 7,364,275 EMS dispatches occurred during the 43,823-hour period. When using temporal variables, the mean absolute error (MAE) rate was 11.5 dispatches in the validation dataset. These were slightly improved following incorporation of weather variables (MAE 11.3). When using 6- and 12-hour lagged weather variables, learners demonstrated lower accuracy (MAE 11.8 in 6-hour lagged datasets; 12.2 in 12-hour lagged dataset). All models had a coefficient of determination (R2) ≥0.91. The extreme gradient boosting and random forest learners were assigned the highest coefficients. In an investigation of variable importance, hour of day and average EMS dispatches over the previous six hours were the most important variables in both the extreme gradient boosting and random forest learners. The algorithm performed well at predicting frequently occurring peaks, with greater challenges at both extremes. Learners created high-acuity and for trauma-related encounters demonstrated superior MAE, but with lower R2 in the validation cohort (MAE 6.9 and R2 0.84 for high acuity encounters; MAE 5.3 and R2 0.79 for trauma in learners using time and weather variables). Conclusion: We developed an ensemble machine learning algorithm to predict EMS dispatches in an urban setting. These models demonstrated high accuracy, with MAEs <12 per hour in all. These algorithms may carry benefit in the real-time prediction of EMS responses, allowing for improved resource utilization.
AB - Objective: To develop and internally validate a metalearner algorithm to predict the hourly rate of emergency medical services (EMS) dispatches in an urban setting. Methods: We performed an analysis of EMS data from New York City between years 2015-2019. Our outcome was hourly EMS dispatches, expressed as continuous data. Hours were split into derivation (75%) and validation (25%) datasets. Candidate variables included averages of prior rates, temporal and weather characteristics. We used a metalearner to evaluate and aggregate individual learners (generalized linear model, generalized additive model, random forest, multivariable adaptive regression splines, and extreme gradient boost). Four models were investigated: 1) temporal variables, 2) weather and temporal variables, and datasets in which weather data were lagged by 3) six and 4) twelve hours. In exploratory analyses, we constructed learners for high acuity and trauma encounters. Results: 7,364,275 EMS dispatches occurred during the 43,823-hour period. When using temporal variables, the mean absolute error (MAE) rate was 11.5 dispatches in the validation dataset. These were slightly improved following incorporation of weather variables (MAE 11.3). When using 6- and 12-hour lagged weather variables, learners demonstrated lower accuracy (MAE 11.8 in 6-hour lagged datasets; 12.2 in 12-hour lagged dataset). All models had a coefficient of determination (R2) ≥0.91. The extreme gradient boosting and random forest learners were assigned the highest coefficients. In an investigation of variable importance, hour of day and average EMS dispatches over the previous six hours were the most important variables in both the extreme gradient boosting and random forest learners. The algorithm performed well at predicting frequently occurring peaks, with greater challenges at both extremes. Learners created high-acuity and for trauma-related encounters demonstrated superior MAE, but with lower R2 in the validation cohort (MAE 6.9 and R2 0.84 for high acuity encounters; MAE 5.3 and R2 0.79 for trauma in learners using time and weather variables). Conclusion: We developed an ensemble machine learning algorithm to predict EMS dispatches in an urban setting. These models demonstrated high accuracy, with MAEs <12 per hour in all. These algorithms may carry benefit in the real-time prediction of EMS responses, allowing for improved resource utilization.
KW - EMS
KW - Prediction model
KW - System status management
UR - http://www.scopus.com/inward/record.url?scp=85107719654&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85107719654&partnerID=8YFLogxK
U2 - 10.1016/j.cmpb.2021.106201
DO - 10.1016/j.cmpb.2021.106201
M3 - Article
C2 - 34139474
AN - SCOPUS:85107719654
SN - 0169-2607
VL - 207
JO - Computer Methods and Programs in Biomedicine
JF - Computer Methods and Programs in Biomedicine
M1 - 106201
ER -