TY - JOUR
T1 - Ancestry adjustment improves genome-wide estimates of regional intolerance
AU - Hayeck, Tristan J.
AU - Stong, Nicholas
AU - Baugh, Evan
AU - Dhindsa, Ryan
AU - Turner, Tychele N.
AU - Malakar, Ayan
AU - Mosbruger, Timothy L.
AU - Shaw, Grace Tzun Wen
AU - Duan, Yuncheng
AU - Ionita-Laza, Iuliana
AU - Goldstein, David
AU - Allen, Andrew S.
N1 - Publisher Copyright:
© The Author(s) 2022.
PY - 2022/6/1
Y1 - 2022/6/1
N2 - Genomic regions subject to purifying selection are more likely to carry disease-causing mutations than regions not under selection. Cross species conservation is often used to identify such regions but with limited resolution to detect selection on short evolutionary timescales such as that occurring in only one species. In contrast, genetic intolerance looks for depletion of variation relative to expectation within a species, allowing species-specific features to be identified. When estimating the intolerance of noncoding sequence, methods strongly leverage variant frequency distributions. As the expected distributions depend on ancestry, if not properly controlled for, ancestral population source may obfuscate signals of selection. We demonstrate that properly incorporating ancestry in intolerance estimation greatly improved variant classification. We provide a genome-wide intolerance map that is conditional on ancestry and likely to be particularly valuable for variant prioritization.
AB - Genomic regions subject to purifying selection are more likely to carry disease-causing mutations than regions not under selection. Cross species conservation is often used to identify such regions but with limited resolution to detect selection on short evolutionary timescales such as that occurring in only one species. In contrast, genetic intolerance looks for depletion of variation relative to expectation within a species, allowing species-specific features to be identified. When estimating the intolerance of noncoding sequence, methods strongly leverage variant frequency distributions. As the expected distributions depend on ancestry, if not properly controlled for, ancestral population source may obfuscate signals of selection. We demonstrate that properly incorporating ancestry in intolerance estimation greatly improved variant classification. We provide a genome-wide intolerance map that is conditional on ancestry and likely to be particularly valuable for variant prioritization.
KW - evolution
KW - genetic epidemiology
KW - genetic intolerance
KW - intolerance to variation
KW - negative selection
KW - population genetics
KW - statistical genetics
UR - http://www.scopus.com/inward/record.url?scp=85131223358&partnerID=8YFLogxK
U2 - 10.1093/genetics/iyac050
DO - 10.1093/genetics/iyac050
M3 - Article
C2 - 35385101
AN - SCOPUS:85131223358
SN - 0016-6731
VL - 221
JO - Genetics
JF - Genetics
IS - 2
M1 - iyac050
ER -