TY - CHAP
T1 - Statistical analysis-meta-analysis/reproducibility
AU - Edmondson, Mackenzie J.
AU - Luo, Chongliang
AU - Chen, Yong
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023. All rights reserved.
PY - 2023/11/4
Y1 - 2023/11/4
N2 - Federated learning has gained great popularities in the last decade for its capability of collaboratively building models on data from multiple datasets. However, in real-world biomedical settings, practical challenges remain, including the needs to protect privacy of the patients, the capability of accounting for between-site heterogeneity in patient characteristics, and, from operational point of view, the number of needed communications across data partners. In this chapter, we describe and provide examples of multi-database data-sharing mechanisms in the healthcare data context and highlight the primary methods available for performing statistical regression analysis in each setting. For each method, we discuss the advantages and disadvantages in terms of data privacy, data communication efficiency, heterogeneity awareness, and statistical accuracy. Our goal is to provide researchers with the insight necessary to choose among the available algorithms for a given setting of conducting regression analysis using multi-site data.
AB - Federated learning has gained great popularities in the last decade for its capability of collaboratively building models on data from multiple datasets. However, in real-world biomedical settings, practical challenges remain, including the needs to protect privacy of the patients, the capability of accounting for between-site heterogeneity in patient characteristics, and, from operational point of view, the number of needed communications across data partners. In this chapter, we describe and provide examples of multi-database data-sharing mechanisms in the healthcare data context and highlight the primary methods available for performing statistical regression analysis in each setting. For each method, we discuss the advantages and disadvantages in terms of data privacy, data communication efficiency, heterogeneity awareness, and statistical accuracy. Our goal is to provide researchers with the insight necessary to choose among the available algorithms for a given setting of conducting regression analysis using multi-site data.
UR - http://www.scopus.com/inward/record.url?scp=85194447001&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-36678-9_8
DO - 10.1007/978-3-031-36678-9_8
M3 - Chapter
AN - SCOPUS:85194447001
SN - 9783031366772
SP - 125
EP - 139
BT - Clinical Applications of Artificial Intelligence in Real-World Data
PB - Springer International Publishing
ER -