TY - JOUR
T1 - Are spatial models advantageous for predicting county-level HIV epidemiology across the United States?
AU - Sass, Danielle
AU - Farkhad, Bita Fayaz
AU - Li, Bo
AU - Sally Chan, Man Pui
AU - Albarracín, Dolores
N1 - Publisher Copyright:
Copyright © 2021 Elsevier Ltd. All rights reserved.
PY - 2021/8/1
Y1 - 2021/8/1
N2 - Predicting human immunodeficiency virus (HIV) epidemiology is vital for achieving public health milestones. Incorporating spatial dependence when data varies by region can often provide better prediction results, at the cost of computational efficiency. However, with the growing number of covariates available that capture the data variability, the benefit of a spatial model could be less crucial. We investigate this conjecture by considering both non-spatial and spatial models for county-level HIV prediction over the US. Due to many counties with zero HIV incidences, we utilize a two-part model, with one part estimating the probability of positive HIV rates and the other estimating HIV rates of counties not classified as zero. Based on our data, the compound of logistic regression and a generalized estimating equation outperforms the candidate models in making predictions. The results suggest that considering spatial correlation for our data is not necessarily advantageous when the purpose is making predictions.
AB - Predicting human immunodeficiency virus (HIV) epidemiology is vital for achieving public health milestones. Incorporating spatial dependence when data varies by region can often provide better prediction results, at the cost of computational efficiency. However, with the growing number of covariates available that capture the data variability, the benefit of a spatial model could be less crucial. We investigate this conjecture by considering both non-spatial and spatial models for county-level HIV prediction over the US. Due to many counties with zero HIV incidences, we utilize a two-part model, with one part estimating the probability of positive HIV rates and the other estimating HIV rates of counties not classified as zero. Based on our data, the compound of logistic regression and a generalized estimating equation outperforms the candidate models in making predictions. The results suggest that considering spatial correlation for our data is not necessarily advantageous when the purpose is making predictions.
KW - Dynamic bayesian network
KW - Generalized estimating equation
KW - HIV Prediction
KW - Quantile regression
KW - Spatial autoregressive model
KW - Two-part model
UR - https://www.scopus.com/pages/publications/85113629947
U2 - 10.1016/j.sste.2021.100436
DO - 10.1016/j.sste.2021.100436
M3 - Article
C2 - 34353528
AN - SCOPUS:85113629947
SN - 1877-5845
VL - 38
SP - 100436
JO - Spatial and Spatio-temporal Epidemiology
JF - Spatial and Spatio-temporal Epidemiology
ER -