TY - JOUR
T1 - dPQL
T2 - A lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling
AU - Luo, Chongliang
AU - Islam, Md Nazmul
AU - Sheils, Natalie E.
AU - Buresh, John
AU - Schuemie, Martijn J.
AU - Doshi, Jalpa A.
AU - Werner, Rachel M.
AU - Asch, David A.
AU - Chen, Yong
N1 - Publisher Copyright:
© 2022 The Author(s). Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved.
PY - 2022/8/1
Y1 - 2022/8/1
N2 - Objective: To develop a lossless distributed algorithm for generalized linear mixed model (GLMM) with application to privacy-preserving hospital profiling. Materials and Methods: The GLMM is often fitted to implement hospital profiling, using clinical or administrative claims data. Due to individual patient data (IPD) privacy regulations and the computational complexity of GLMM, a distributed algorithm for hospital profiling is needed. We develop a novel distributed penalized quasi-likelihood (dPQL) algorithm to fit GLMM when only aggregated data, rather than IPD, can be shared across hospitals. We also show that the standardized mortality rates, which are often reported as the results of hospital profiling, can also be calculated distributively without sharing IPD. We demonstrate the applicability of the proposed dPQL algorithm by ranking 929 hospitals for coronavirus disease 2019 (COVID-19) mortality or referral to hospice that have been previously studied. Results: The proposed dPQL algorithm is mathematically proven to be lossless, that is, it obtains identical results as if IPD were pooled from all hospitals. In the example of hospital profiling regarding COVID-19 mortality, the dPQL algorithm reached convergence with only 5 iterations, and the estimation of fixed effects, random effects, and mortality rates were identical to that of the PQL from pooled data. Conclusion: The dPQL algorithm is lossless, privacy-preserving and fast-converging for fitting GLMM. It provides an extremely suitable and convenient distributed approach for hospital profiling.
AB - Objective: To develop a lossless distributed algorithm for generalized linear mixed model (GLMM) with application to privacy-preserving hospital profiling. Materials and Methods: The GLMM is often fitted to implement hospital profiling, using clinical or administrative claims data. Due to individual patient data (IPD) privacy regulations and the computational complexity of GLMM, a distributed algorithm for hospital profiling is needed. We develop a novel distributed penalized quasi-likelihood (dPQL) algorithm to fit GLMM when only aggregated data, rather than IPD, can be shared across hospitals. We also show that the standardized mortality rates, which are often reported as the results of hospital profiling, can also be calculated distributively without sharing IPD. We demonstrate the applicability of the proposed dPQL algorithm by ranking 929 hospitals for coronavirus disease 2019 (COVID-19) mortality or referral to hospice that have been previously studied. Results: The proposed dPQL algorithm is mathematically proven to be lossless, that is, it obtains identical results as if IPD were pooled from all hospitals. In the example of hospital profiling regarding COVID-19 mortality, the dPQL algorithm reached convergence with only 5 iterations, and the estimation of fixed effects, random effects, and mortality rates were identical to that of the PQL from pooled data. Conclusion: The dPQL algorithm is lossless, privacy-preserving and fast-converging for fitting GLMM. It provides an extremely suitable and convenient distributed approach for hospital profiling.
KW - distributed penalized quasi-likelihood algorithm
KW - federated learning
KW - generalized linear mixed model
KW - hospital profiling
KW - privacy-preserving
UR - http://www.scopus.com/inward/record.url?scp=85131894561&partnerID=8YFLogxK
U2 - 10.1093/jamia/ocac067
DO - 10.1093/jamia/ocac067
M3 - Article
C2 - 35579348
AN - SCOPUS:85131894561
SN - 1067-5027
VL - 29
SP - 1366
EP - 1371
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 8
ER -