TY - JOUR
T1 - Limitations of clinical trial sample size estimate by subtraction of two measurements
AU - for the Alzheimer's Disease Neuroimaging Initiative
AU - Chen, Kewei
AU - Guo, Xiaojuan
AU - Pan, Rong
AU - Xiong, Chengjie
AU - Harvey, Danielle J.
AU - Chen, Yinghua
AU - Yao, Li
AU - Su, Yi
AU - Reiman, Eric M.
N1 - Publisher Copyright:
© 2021 John Wiley & Sons Ltd.
PY - 2022/3/30
Y1 - 2022/3/30
N2 - In planning randomized clinical trials (RCTs) for diseases such as Alzheimer's disease (AD), researchers frequently rely on the use of existing data obtained from only two time points to estimate sample size via the subtraction of baseline from follow-up measurements in each subject. However, the inadequacy of this method has not been reported. The aim of this study is to discuss the limitation of sample size estimation based on the subtraction of available data from only two time points for RCTs. Mathematical equations are derived to demonstrate the condition under which the obtained data pairs with variable time intervals could be used to adequately estimate sample size. The MRI-based hippocampal volume measurements from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and Monte Carlo simulations (MCS) were used to illustrate the existing bias and variability of estimates. MCS results support the theoretically derived condition under which the subtraction approach may work. MCS also show the systematically under- or over-estimated sample sizes by up to 32.27 (Formula presented.) bias. Not used properly, such subtraction approach outputs the same sample size regardless of trial durations partly due to the way measurement errors are handled. Estimating sample size by subtracting two measurements should be treated with caution. Such estimates can be biased, the magnitude of which depends on the planned RCT duration. To estimate sample sizes, we recommend using more than two measurements and more comprehensive approaches such as linear mixed effect models.
AB - In planning randomized clinical trials (RCTs) for diseases such as Alzheimer's disease (AD), researchers frequently rely on the use of existing data obtained from only two time points to estimate sample size via the subtraction of baseline from follow-up measurements in each subject. However, the inadequacy of this method has not been reported. The aim of this study is to discuss the limitation of sample size estimation based on the subtraction of available data from only two time points for RCTs. Mathematical equations are derived to demonstrate the condition under which the obtained data pairs with variable time intervals could be used to adequately estimate sample size. The MRI-based hippocampal volume measurements from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and Monte Carlo simulations (MCS) were used to illustrate the existing bias and variability of estimates. MCS results support the theoretically derived condition under which the subtraction approach may work. MCS also show the systematically under- or over-estimated sample sizes by up to 32.27 (Formula presented.) bias. Not used properly, such subtraction approach outputs the same sample size regardless of trial durations partly due to the way measurement errors are handled. Estimating sample size by subtracting two measurements should be treated with caution. Such estimates can be biased, the magnitude of which depends on the planned RCT duration. To estimate sample sizes, we recommend using more than two measurements and more comprehensive approaches such as linear mixed effect models.
KW - linear mixed effects model
KW - randomized clinical trial
KW - sample size estimation
KW - subtraction
KW - two time point measurement
UR - http://www.scopus.com/inward/record.url?scp=85118373415&partnerID=8YFLogxK
U2 - 10.1002/sim.9244
DO - 10.1002/sim.9244
M3 - Article
C2 - 34725853
AN - SCOPUS:85118373415
SN - 0277-6715
VL - 41
SP - 1137
EP - 1147
JO - Statistics in medicine
JF - Statistics in medicine
IS - 7
ER -