Comparing 22 Popular Phosphoproteomics Pipelines for Peptide Identification and Site Localization

Research output: Contribution to journalJournal articleResearchpeer-review

Phosphorylation-driven cell signaling governs most biological functions and is widely studied using mass-spectrometry-based phosphoproteomics. Identifying the peptides and localizing the phosphorylation sites within them from the raw data is challenging and can be performed by several algorithms that return scores that are not directly comparable. This increases the heterogeneity among published phosphoproteomics data sets and prevents their direct integration. Here we compare 22 pipelines implemented in the main software tools used for bottom-up phosphoproteomics analysis (MaxQuant, Proteome Discoverer, PeptideShaker). We test six search engines (Andromeda, Comet, Mascot, MS Amanda, SequestHT, and X!Tandem) in combination with several localization scoring algorithms (delta score, D-score, PTM-score, phosphoRS, and Ascore). We show that these follow very different score distributions, which can lead to different false localization rates for the same threshold. We provide a strategy to discriminate correctly from incorrectly localized phosphorylation sites in a consistent manner across the tested pipelines. The results presented here can help users choose the most appropriate pipeline and cutoffs for their phosphoproteomics analysis.

Original languageEnglish
JournalJournal of Proteome Research
Volume19
Issue number3
Pages (from-to)1338-1345
Number of pages8
ISSN1535-3893
DOIs
Publication statusPublished - 2020

ID: 240408550