TY - JOUR
T1 - Avoiding the high bonferroni penalty in genome-wide association studies
AU - Gao, Xiaoyi
AU - Becker, Lewis C.
AU - Becker, Diane M.
AU - Starmer, Joshua D.
AU - Province, Michael A.
PY - 2010/1
Y1 - 2010/1
N2 - A major challenge in genome-wide association studies (GWASs) is to derive the multiple testing threshold when hypothesis tests are conducted using a large number of single nucleotide polymorphisms. Permutation tests are considered the gold standard in multiple testing adjustment in genetic association studies. However, it is computationally intensive, especially for GWASs, and can be impractical if a large number of random shuffles are used to ensure accuracy. Many researchers have developed approximation algorithms to relieve the computing burden imposed by permutation. One particularly attractive alternative to permutation is to calculate the effective number of independent tests, M eff, which has been shown to be promising in genetic association studies. In this study, we compare recently developed M eff methods and validate them by the permutation test with 10,000 random shuffles using two real GWAS data sets: an Illumina 1M BeadChip and an Affymetrix GeneChip® Human Mapping 500K Array Set. Our results show that the simpleM method produces the best approximation of the permutation threshold, and it does so in the shortest amount of time. We also show that M eff is indeed valid on a genome-wide scale in these data sets based on statistical theory and significance tests. The significance thresholds derived can provide practical guidelines for other studies using similar population samples and genotyping platforms.
AB - A major challenge in genome-wide association studies (GWASs) is to derive the multiple testing threshold when hypothesis tests are conducted using a large number of single nucleotide polymorphisms. Permutation tests are considered the gold standard in multiple testing adjustment in genetic association studies. However, it is computationally intensive, especially for GWASs, and can be impractical if a large number of random shuffles are used to ensure accuracy. Many researchers have developed approximation algorithms to relieve the computing burden imposed by permutation. One particularly attractive alternative to permutation is to calculate the effective number of independent tests, M eff, which has been shown to be promising in genetic association studies. In this study, we compare recently developed M eff methods and validate them by the permutation test with 10,000 random shuffles using two real GWAS data sets: an Illumina 1M BeadChip and an Affymetrix GeneChip® Human Mapping 500K Array Set. Our results show that the simpleM method produces the best approximation of the permutation threshold, and it does so in the shortest amount of time. We also show that M eff is indeed valid on a genome-wide scale in these data sets based on statistical theory and significance tests. The significance thresholds derived can provide practical guidelines for other studies using similar population samples and genotyping platforms.
KW - Genetic association studies
KW - Genome-wide significance
KW - Multiple testing correction
UR - http://www.scopus.com/inward/record.url?scp=75649151064&partnerID=8YFLogxK
U2 - 10.1002/gepi.20430
DO - 10.1002/gepi.20430
M3 - Article
C2 - 19434714
AN - SCOPUS:75649151064
SN - 0741-0395
VL - 34
SP - 100
EP - 105
JO - Genetic Epidemiology
JF - Genetic Epidemiology
IS - 1
ER -