TY - JOUR
T1 - Adaptive Recruitment Resource Allocation to Improve Cohort Representativeness in Participatory Biomedical Datasets
AU - Borza, Victor A.
AU - Estornell, Andrew
AU - Clayton, Ellen Wright
AU - Ho, Chien-Ju
AU - Rothman, Russell L.
AU - Vorobeychik, Yevgeniy
AU - Malin, Bradley A.
N1 - Publisher Copyright:
©2024 AMIA - All rights reserved.
PY - 2024
Y1 - 2024
N2 - Large participatory biomedical studies - studies that recruit individuals to join a dataset - are gaining popularity and investment, especially for analysis by modern AI methods. Because they purposively recruit participants, these studies are uniquely able to address a lack of historical representation, an issue that has affected many biomedical datasets. In this work, we define representativeness as the similarity to a target population distribution of a set of attributes and our goal is to mirror the U.S. population across distributions of age, gender, race, and ethnicity. Many participatory studies recruit at several institutions, so we introduce a computational approach to adaptively allocate recruitment resources among sites to improve representativeness. In simulated recruitment of 10,000-participant cohorts from medical centers in the STAR Clinical Research Network, we show that our approach yields a more representative cohort than existing baselines. Thus, we highlight the value of computational modeling in guiding recruitment efforts.
AB - Large participatory biomedical studies - studies that recruit individuals to join a dataset - are gaining popularity and investment, especially for analysis by modern AI methods. Because they purposively recruit participants, these studies are uniquely able to address a lack of historical representation, an issue that has affected many biomedical datasets. In this work, we define representativeness as the similarity to a target population distribution of a set of attributes and our goal is to mirror the U.S. population across distributions of age, gender, race, and ethnicity. Many participatory studies recruit at several institutions, so we introduce a computational approach to adaptively allocate recruitment resources among sites to improve representativeness. In simulated recruitment of 10,000-participant cohorts from medical centers in the STAR Clinical Research Network, we show that our approach yields a more representative cohort than existing baselines. Thus, we highlight the value of computational modeling in guiding recruitment efforts.
UR - http://www.scopus.com/inward/record.url?scp=105006857067&partnerID=8YFLogxK
M3 - Article
C2 - 40417546
AN - SCOPUS:105006857067
SN - 1559-4076
VL - 2024
SP - 192
EP - 201
JO - AMIA ... Annual Symposium proceedings. AMIA Symposium
JF - AMIA ... Annual Symposium proceedings. AMIA Symposium
ER -