TY - JOUR
T1 - Integrating coexpression networks with GWAS to prioritize causal genes in maize
AU - Schaefer, Robert J.
AU - Michno, Jean Michel
AU - Jeffers, Joseph
AU - Hoekenga, Owen
AU - Dilkes, Brian
AU - Baxter, Ivan
AU - Myersc, Chad L.
N1 - Funding Information:
We thank Ben VanderSluis, Henry Ward, and Joanna Dinsmore for their helpful comments and feedback in writing this article. We also thank Abby Cabunoc-Mayes and other members of the Mozilla Science Lab for their mentorshipandhelpinmakingCamocoafreeandopenscientificresource. This work was supported by funding from the National Science Foundation (IOS-1126950, IOS-1444503, and IOS-1450341), the USDA Agricultural Research Service (5070-21000-039-00D), and the USDA National Institute for Food and Agriculture (2016-67012-24841).
Funding Information:
We thank Ben VanderSluis, Henry Ward, and Joanna Dinsmore for their helpful comments and feedback in writing this article. We also thank Abby Cabunoc-Mayes and other members of the Mozilla Science Lab for their mentorshipand help inmakingCamocoa freeandopen scientific resource. This workwas supported by funding from the National Science Foundation (IOS-1126950, IOS-1444503, and IOS-1450341), the USDA Agricultural Research Service (5070-21000-039-00D), and the USDA National Institute for Food and Agriculture (2016-67012-24841).
Publisher Copyright:
© 2018 ASPB.
PY - 2018/12
Y1 - 2018/12
N2 - Genome-wide association studies (GWAS) have identified loci linked to hundreds of traits in many different species. Yet, because linkage equilibrium implicates a broad region surrounding each identified locus, the causal genes often remain unknown. This problem is especially pronounced in nonhuman, nonmodel species, where functional annotations are sparse and there is frequently little information available for prioritizing candidate genes. We developed a computational approach, Camoco, that integrates loci identified by GWAS with functional information derived from gene coexpression networks. Using Camoco, we prioritized candidate genes from a large-scale GWAS examining the accumulation of 17 different elements in maize (Zea mays) seeds. Strikingly, we observed a strong dependence in the performance of our approach based on the type of coexpression network used: expression variation across genetically diverse individuals in a relevant tissue context (in our case, roots that are the primary elemental uptake and delivery system) outperformed other alternative networks. Two candidate genes identified by our approach were validated using mutants. Our study demonstrates that coexpression networks provide a powerful basis for prioritizing candidate causal genes from GWAS loci but suggests that the success of such strategies can highly depend on the gene expression data context. Both the software and the lessons on integrating GWAS data with coexpression networks generalize to species beyond maize.
AB - Genome-wide association studies (GWAS) have identified loci linked to hundreds of traits in many different species. Yet, because linkage equilibrium implicates a broad region surrounding each identified locus, the causal genes often remain unknown. This problem is especially pronounced in nonhuman, nonmodel species, where functional annotations are sparse and there is frequently little information available for prioritizing candidate genes. We developed a computational approach, Camoco, that integrates loci identified by GWAS with functional information derived from gene coexpression networks. Using Camoco, we prioritized candidate genes from a large-scale GWAS examining the accumulation of 17 different elements in maize (Zea mays) seeds. Strikingly, we observed a strong dependence in the performance of our approach based on the type of coexpression network used: expression variation across genetically diverse individuals in a relevant tissue context (in our case, roots that are the primary elemental uptake and delivery system) outperformed other alternative networks. Two candidate genes identified by our approach were validated using mutants. Our study demonstrates that coexpression networks provide a powerful basis for prioritizing candidate causal genes from GWAS loci but suggests that the success of such strategies can highly depend on the gene expression data context. Both the software and the lessons on integrating GWAS data with coexpression networks generalize to species beyond maize.
UR - http://www.scopus.com/inward/record.url?scp=85060581721&partnerID=8YFLogxK
U2 - 10.1105/tpc.18.00299
DO - 10.1105/tpc.18.00299
M3 - Article
C2 - 30413654
AN - SCOPUS:85060581721
SN - 1040-4651
VL - 30
SP - 2922
EP - 2942
JO - Plant Cell
JF - Plant Cell
IS - 12
ER -