TY - GEN
T1 - Fallacies and Pitfalls in Genome-Wide Association Studies
AU - Hecker, Julian
AU - Craig, Adam
AU - Hughes, Andrew
AU - Neidich, Julie
AU - Taswell, Carl
AU - Laird, Nan
N1 - Publisher Copyright:
© 2023 Brain Health Alliance.
PY - 2023
Y1 - 2023
N2 - Since the first genome-wide association study (GWAS) identifying variants associated with myocardial infarction was published over 20 years ago, GWASs have emerged as a powerful tool for exploring the genetic basis of complex traits. To date, hundreds of thousands of statistically significant associations have been reported across thousands of human phenotypes. Nevertheless, the design, implementation, and analysis of GWASs remain complex, and the results are easily misinterpreted. Common mistakes include 1) assuming that variants with the strongest statistical associations are causal instead of correlative, 2) believing that associated loci act through nearby genes, and 3) overemphasizing the contribution of individual loci to the total variability of particular traits. Clinical assays have been designed using the results of GWAS that rely on the contribution of such erroneous data interpretations to predict clinical phenotypes, reactions to medications or foods, and/or propensity to develop diseases. The failure to recognize these errors due to fallacies in logical reasoning and statistical inference presents problems for both the scientific community when the wrong targets may be prioritized in future research studies, as well as for communication with the general public when our understanding of the genetic basis of important traits may be misrepresented and overstated. Here, we review statistical data quality, analysis, and meta-analysis, of GWAS results with an emphasis on accurate and reliable interpretation. Placed in the appropriate context, GWASs enable genome-wide discovery of loci associated with diverse traits, but they constitute only a first step towards understanding the biological mechanism(s) underlying the observed associations. Scientific elucidation of these biological mechanisms must be required to establish causality with biochemical and pathophysiological explanations for any putative statistical correlations.
AB - Since the first genome-wide association study (GWAS) identifying variants associated with myocardial infarction was published over 20 years ago, GWASs have emerged as a powerful tool for exploring the genetic basis of complex traits. To date, hundreds of thousands of statistically significant associations have been reported across thousands of human phenotypes. Nevertheless, the design, implementation, and analysis of GWASs remain complex, and the results are easily misinterpreted. Common mistakes include 1) assuming that variants with the strongest statistical associations are causal instead of correlative, 2) believing that associated loci act through nearby genes, and 3) overemphasizing the contribution of individual loci to the total variability of particular traits. Clinical assays have been designed using the results of GWAS that rely on the contribution of such erroneous data interpretations to predict clinical phenotypes, reactions to medications or foods, and/or propensity to develop diseases. The failure to recognize these errors due to fallacies in logical reasoning and statistical inference presents problems for both the scientific community when the wrong targets may be prioritized in future research studies, as well as for communication with the general public when our understanding of the genetic basis of important traits may be misrepresented and overstated. Here, we review statistical data quality, analysis, and meta-analysis, of GWAS results with an emphasis on accurate and reliable interpretation. Placed in the appropriate context, GWASs enable genome-wide discovery of loci associated with diverse traits, but they constitute only a first step towards understanding the biological mechanism(s) underlying the observed associations. Scientific elucidation of these biological mechanisms must be required to establish causality with biochemical and pathophysiological explanations for any putative statistical correlations.
KW - correlation-causation fallacy
KW - family-based association studies (FBAS)
KW - fixed effects model
KW - Genome-wide association studies (GWAS)
KW - meta-analysis
KW - population stratification
KW - random effects model
UR - http://www.scopus.com/inward/record.url?scp=85190526595&partnerID=8YFLogxK
U2 - 10.48085/GFA4E8812
DO - 10.48085/GFA4E8812
M3 - Conference contribution
AN - SCOPUS:85190526595
T3 - 2023 Guardians Workshop, Guardians 2023
BT - 2023 Guardians Workshop, Guardians 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 Guardians Workshop, Guardians 2023
Y2 - 9 October 2023
ER -