TY - JOUR
T1 - An analysis of the sensitivity of proteogenomic mapping of somatic mutations and novel splicing events in cancer
AU - Ruggles, Kelly V.
AU - Tang, Zuojian
AU - Wang, Xuya
AU - Grover, Himanshu
AU - Askenazi, Manor
AU - Teubl, Jennifer
AU - Cao, Song
AU - McLellan, Michael D.
AU - Clauser, Karl R.
AU - Tabb, David L.
AU - Mertins, Philipp
AU - Slebos, Robbert
AU - Erdmann-Gilmore, Petra
AU - Li, Shunqiang
AU - Gunawardena, Harsha P.
AU - Xie, Ling
AU - Liu, Tao
AU - Zhou, Jian Ying
AU - Sun, Shisheng
AU - Hoadley, Katherine A.
AU - Perou, Charles M.
AU - Chen, Xian
AU - Davies, Sherri R.
AU - Maher, Christopher A.
AU - Kinsinger, Christopher R.
AU - Rodland, Karen D.
AU - Zhang, Hui
AU - Zhang, Zhen
AU - Ding, Li
AU - Townsend, R. Reid
AU - Rodriguez, Henry
AU - Chan, Daniel
AU - Smith, Richard D.
AU - Liebler, Daniel C.
AU - Carr, Steven A.
AU - Payne, Samuel
AU - Ellis, Matthew J.
AU - Fenyo, David
N1 - Publisher Copyright:
© 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
PY - 2016/3
Y1 - 2016/3
N2 - Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis.
AB - Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis.
UR - http://www.scopus.com/inward/record.url?scp=84962585103&partnerID=8YFLogxK
U2 - 10.1074/mcp.M115.056226
DO - 10.1074/mcp.M115.056226
M3 - Article
C2 - 26631509
AN - SCOPUS:84962585103
SN - 1535-9476
VL - 15
SP - 1060
EP - 1071
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 3
ER -