TY - JOUR
T1 - PILOT-PROTEIN
T2 - Identification of unmodified and modified proteins via high-resolution mass spectrometry and mixed-integer linear optimization
AU - Baliban, Richard C.
AU - Dimaggio, Peter A.
AU - Plazas-Mayorca, Mariana D.
AU - Garcia, Benjamin A.
AU - Floudas, Christodoulos A.
PY - 2012/9/7
Y1 - 2012/9/7
N2 - A novel protein identification framework, PILOT-PROTEIN, has been developed to construct a comprehensive list of all unmodified proteins that are present in a living sample. It uses the peptide identification results from the PILOT-SEQUEL algorithm to initially determine all unmodified proteins within the sample. Using a rigorous biclustering approach that groups incorrect peptide sequences with other homologous sequences, the number of false positives reported is minimized. A sequence tag procedure is then incorporated along with the untargeted PTM identification algorithm, PILOT-PTM, to determine a list of all modification types and sites for each protein. The unmodified protein identification algorithm, PILOT-PROTEIN, is compared to the methods SEQUEST, InsPecT, X!Tandem, VEMS, and ProteinProspector using both prepared protein samples and a more complex chromatin digest. The algorithm demonstrates superior protein identification accuracy with a lower false positive rate. All materials are freely available to the scientific community at http://pumpd.princeton.edu.
AB - A novel protein identification framework, PILOT-PROTEIN, has been developed to construct a comprehensive list of all unmodified proteins that are present in a living sample. It uses the peptide identification results from the PILOT-SEQUEL algorithm to initially determine all unmodified proteins within the sample. Using a rigorous biclustering approach that groups incorrect peptide sequences with other homologous sequences, the number of false positives reported is minimized. A sequence tag procedure is then incorporated along with the untargeted PTM identification algorithm, PILOT-PTM, to determine a list of all modification types and sites for each protein. The unmodified protein identification algorithm, PILOT-PROTEIN, is compared to the methods SEQUEST, InsPecT, X!Tandem, VEMS, and ProteinProspector using both prepared protein samples and a more complex chromatin digest. The algorithm demonstrates superior protein identification accuracy with a lower false positive rate. All materials are freely available to the scientific community at http://pumpd.princeton.edu.
KW - de novo/database search
KW - integer linear optimization
KW - peptide identification
KW - post-translational modifications
KW - protein identification
KW - tandem mass spectrometry
UR - http://www.scopus.com/inward/record.url?scp=84866084892&partnerID=8YFLogxK
U2 - 10.1021/pr300418j
DO - 10.1021/pr300418j
M3 - Article
C2 - 22788846
AN - SCOPUS:84866084892
SN - 1535-3893
VL - 11
SP - 4615
EP - 4629
JO - Journal of Proteome Research
JF - Journal of Proteome Research
IS - 9
ER -