TY - JOUR
T1 - Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal
AU - Schulze, Stefan
AU - Igiraneza, Aime Bienfait
AU - Kösters, Manuel
AU - Leufken, Johannes
AU - Leidel, Sebastian A.
AU - Garcia, Benjamin A.
AU - Fufezan, Christian
AU - Pohlschroder, Mechthild
N1 - Funding Information:
S.S. was funded by the German Research Foundation (DFG Postdoctoral Fellowship, 398625447). A.B.I. was supported by the Spring 2018 Pincus-Magaziner Family Undergraduate Research and Travel Fund from the College Alumni Society and the Seltzer Family Digital Media Award at the University of Pennsylvania. M.P., A.B.I., and S.S. were supported by the National Science Foundation Grant 1817518. M.K., J.L., and S.A.L. were funded by the NCCR RNA & Disease (Swiss National Science Foundation).
Publisher Copyright:
© 2021 The Authors. Published by American Chemical Society.
PY - 2021/4/2
Y1 - 2021/4/2
N2 - The identification of peptide sequences and their post-translational modifications (PTMs) is a crucial step in the analysis of bottom-up proteomics data. The recent development of open modification search (OMS) engines allows virtually all PTMs to be searched for. This not only increases the number of spectra that can be matched to peptides but also greatly advances the understanding of the biological roles of PTMs through the identification, and the thereby facilitated quantification, of peptidoforms (peptide sequences and their potential PTMs). Whereas the benefits of combining results from multiple protein database search engines have been previously established, similar approaches for OMS results have been missing so far. Here we compare and combine results from three different OMS engines, demonstrating an increase in peptide spectrum matches of 8-18%. The unification of search results furthermore allows for the combined downstream processing of search results, including the mapping to potential PTMs. Finally, we test for the ability of OMS engines to identify glycosylated peptides. The implementation of these engines in the Python framework Ursgal facilitates the straightforward application of the OMS with unified parameters and results files, thereby enabling yet unmatched high-throughput, large-scale data analysis.
AB - The identification of peptide sequences and their post-translational modifications (PTMs) is a crucial step in the analysis of bottom-up proteomics data. The recent development of open modification search (OMS) engines allows virtually all PTMs to be searched for. This not only increases the number of spectra that can be matched to peptides but also greatly advances the understanding of the biological roles of PTMs through the identification, and the thereby facilitated quantification, of peptidoforms (peptide sequences and their potential PTMs). Whereas the benefits of combining results from multiple protein database search engines have been previously established, similar approaches for OMS results have been missing so far. Here we compare and combine results from three different OMS engines, demonstrating an increase in peptide spectrum matches of 8-18%. The unification of search results furthermore allows for the combined downstream processing of search results, including the mapping to potential PTMs. Finally, we test for the ability of OMS engines to identify glycosylated peptides. The implementation of these engines in the Python framework Ursgal facilitates the straightforward application of the OMS with unified parameters and results files, thereby enabling yet unmatched high-throughput, large-scale data analysis.
KW - bioinformatics
KW - glycosylation
KW - open modification search
KW - post-translational modifications
KW - proteomics
KW - python
UR - https://www.scopus.com/pages/publications/85100637615
U2 - 10.1021/acs.jproteome.0c00799
DO - 10.1021/acs.jproteome.0c00799
M3 - Article
C2 - 33514075
AN - SCOPUS:85100637615
SN - 1535-3893
VL - 20
SP - 1986
EP - 1996
JO - Journal of Proteome Research
JF - Journal of Proteome Research
IS - 4
ER -