TY - JOUR
T1 - An analysis of the sensitivity of proteogenomic mapping of somatic mutations and novel splicing events in cancer
AU - Ruggles, Kelly V.
AU - Tang, Zuojian
AU - Wang, Xuya
AU - Grover, Himanshu
AU - Askenazi, Manor
AU - Teubl, Jennifer
AU - Cao, Song
AU - McLellan, Michael D.
AU - Clauser, Karl R.
AU - Tabb, David L.
AU - Mertins, Philipp
AU - Slebos, Robbert
AU - Erdmann-Gilmore, Petra
AU - Li, Shunqiang
AU - Gunawardena, Harsha P.
AU - Xie, Ling
AU - Liu, Tao
AU - Zhou, Jian Ying
AU - Sun, Shisheng
AU - Hoadley, Katherine A.
AU - Perou, Charles M.
AU - Chen, Xian
AU - Davies, Sherri R.
AU - Maher, Christopher A.
AU - Kinsinger, Christopher R.
AU - Rodland, Karen D.
AU - Zhang, Hui
AU - Zhang, Zhen
AU - Ding, Li
AU - Townsend, R. Reid
AU - Rodriguez, Henry
AU - Chan, Daniel
AU - Smith, Richard D.
AU - Liebler, Daniel C.
AU - Carr, Steven A.
AU - Payne, Samuel
AU - Ellis, Matthew J.
AU - Fenyo, David
N1 - Funding Information:
National Cancer Institute Clinical Proteomics Tumor Analysis Consortium (NCI CPTAC) investigators: Broad Institute of MIT and Harvard: Steven A. Carr, Michael A. Gillette, Karl R. Klauser, Eric Kuhn, D.R. Mani, Philipp Mertins; Enterprise Science and Computing, Inc.: Karen A. Ketchum; Fred Hutchinson Cancer Research Center: Amanda G. Paulovich, Jeffrey R. Whiteaker; Georgetown University: Nathan J. Edwards, Subha Madhavan, Peter B. McGarvey; Icahn School of Medicine at Mount Sinai: Pei Wang; Johns Hopkins University: Daniel Chan, Akhilesh Pandey, Ie-Ming Shih, Hui Zhang, Zhen Zhang, Heng Zhu; Leidos, Inc.: Gordon A. Whiteley; Massachusetts General Hospital and Harvard University: Steven J. Skates; Massachusetts Institute of Technology: Forest M. White; Memorial Sloan Kettering Cancer Center: Douglas A. Levine; National Cancer Institute: Emily S. Boja, Christopher R. Kinsinger, Tara Hiltke, Mehdi Mesri, Robert C. Rivers, Henry Rodriguez, Kenna M. Shaw; National Institute of Standards and Technology: Stephen E. Stein; New York University: David Fenyo; Pacific Northwest National Laboratory: Tao Liu, Jason E. McDermott, Samuel H. Payne, Karin D. Rodland, Richard D. Smith; Spectragen-Informatics: Paul Rudnick; Stanford University: Michael Snyder; University of Chicago: Yingming Zhao; University of North Carolina at Chapel Hill: Xian Chen, David F. Ransohoff; University of Washington: Andrew N. Hoofnagle; Vanderbilt University: Daniel C. Liebler, Melinda E. Sanders, Zhiao Shi, Robbert J.C. Slebos, David L. Tabb, Bing Zhang, Lisa J. Zimmerman; Virginia Tech: Yue Wang; Washington University in St. Louis: Sherri R. Davies, Li Ding, Matthew J. Ellis, R. Reid Townsend. This work was supported by National Cancer Institute (NCI) CPTAC awards U24CA159988, U24CA160019, U24CA160034, U24CA160035, U24CA160036 and by CPTAC contract 13XS068 from Leidos Biomedical Research, Inc. This work has utilized computing resources at the High Performance Computing Facility of the Center for Health Informatics and Bioinformatics at the NYU Langone Medical Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Publisher Copyright:
© 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
PY - 2016/3
Y1 - 2016/3
N2 - Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis.
AB - Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis.
UR - http://www.scopus.com/inward/record.url?scp=84962585103&partnerID=8YFLogxK
U2 - 10.1074/mcp.M115.056226
DO - 10.1074/mcp.M115.056226
M3 - Article
C2 - 26631509
AN - SCOPUS:84962585103
SN - 1535-9476
VL - 15
SP - 1060
EP - 1071
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 3
ER -