TY - JOUR
T1 - Impact of three Illumina library construction methods on GC bias and HLA genotype calling
AU - Lan, James H.
AU - Yin, Yuxin
AU - Reed, Elaine F.
AU - Moua, Kevin
AU - Thomas, Kimberly
AU - Zhang, Qiuheng
N1 - Funding Information:
JHL is supported by funding from the University of British Columbia Clinician Investigator Program . The research described was supported by NIH/National Center for Advancing Translational Science (NCATS) UCLA CTSI Grant Number UL1TR000124 . The authors give thanks to One Lambda for supplying the HLA PCR primers.
Publisher Copyright:
© 2015 American Society for Histocompatibility and Immunogenetics.
PY - 2015/3/1
Y1 - 2015/3/1
N2 - Next-generation sequencing (NGS) is increasingly recognized for its ability to overcome allele ambiguity and deliver high-resolution typing in the HLA system. Using this technology, non-uniform read distribution can impede the reliability of variant detection, which renders high-confidence genotype calling particularly difficult to achieve in the polymorphic HLA complex. Recently, library construction has been implicated as the dominant factor in instigating coverage bias. To study the impact of this phenomenon on HLA genotyping, we performed long-range PCR on 12 samples to amplify HLA-A, -B, -C, -DRB1, and -DQB1, and compared the relative contribution of three Illumina library construction methods (TruSeq Nano, Nextera, Nextera XT) in generating downstream bias. Here, we show high GC% to be a good predictor of low sequencing depth. Compared to standard TruSeq Nano, GC bias was more prominent in transposase-based protocols, particularly Nextera XT, likely through a combination of transposase insertion bias being coupled with a high number of PCR enrichment cycles. Importantly, our findings demonstrate non-uniform read depth can have a direct and negative impact on the robustness of HLA genotyping, which has clinical implications for users when choosing a library construction strategy that aims to balance cost and throughput with data quality.
AB - Next-generation sequencing (NGS) is increasingly recognized for its ability to overcome allele ambiguity and deliver high-resolution typing in the HLA system. Using this technology, non-uniform read distribution can impede the reliability of variant detection, which renders high-confidence genotype calling particularly difficult to achieve in the polymorphic HLA complex. Recently, library construction has been implicated as the dominant factor in instigating coverage bias. To study the impact of this phenomenon on HLA genotyping, we performed long-range PCR on 12 samples to amplify HLA-A, -B, -C, -DRB1, and -DQB1, and compared the relative contribution of three Illumina library construction methods (TruSeq Nano, Nextera, Nextera XT) in generating downstream bias. Here, we show high GC% to be a good predictor of low sequencing depth. Compared to standard TruSeq Nano, GC bias was more prominent in transposase-based protocols, particularly Nextera XT, likely through a combination of transposase insertion bias being coupled with a high number of PCR enrichment cycles. Importantly, our findings demonstrate non-uniform read depth can have a direct and negative impact on the robustness of HLA genotyping, which has clinical implications for users when choosing a library construction strategy that aims to balance cost and throughput with data quality.
KW - GC bias
KW - HLA genotyping
KW - Illumina Nextera
KW - NGS
KW - Transposase
UR - http://www.scopus.com/inward/record.url?scp=84924085837&partnerID=8YFLogxK
U2 - 10.1016/j.humimm.2014.12.016
DO - 10.1016/j.humimm.2014.12.016
M3 - Article
C2 - 25543015
AN - SCOPUS:84924085837
SN - 0198-8859
VL - 76
SP - 166
EP - 175
JO - Human Immunology
JF - Human Immunology
IS - 2-3
ER -