Cargando…

Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation

[Image: see text] Public phosphorylation databases such as PhosphoSitePlus (PSP) and PeptideAtlas (PA) compile results from published papers or openly available mass spectrometry (MS) data. However, there is no database-level control for false discovery of sites, likely leading to the overestimation...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalyuzhnyy, Anton, Eyers, Patrick A., Eyers, Claire E., Bowler-Barnett, Emily, Martin, Maria J., Sun, Zhi, Deutsch, Eric W., Jones, Andrew R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171898/
https://www.ncbi.nlm.nih.gov/pubmed/35532924
http://dx.doi.org/10.1021/acs.jproteome.2c00131
_version_ 1784721771530813440
author Kalyuzhnyy, Anton
Eyers, Patrick A.
Eyers, Claire E.
Bowler-Barnett, Emily
Martin, Maria J.
Sun, Zhi
Deutsch, Eric W.
Jones, Andrew R.
author_facet Kalyuzhnyy, Anton
Eyers, Patrick A.
Eyers, Claire E.
Bowler-Barnett, Emily
Martin, Maria J.
Sun, Zhi
Deutsch, Eric W.
Jones, Andrew R.
author_sort Kalyuzhnyy, Anton
collection PubMed
description [Image: see text] Public phosphorylation databases such as PhosphoSitePlus (PSP) and PeptideAtlas (PA) compile results from published papers or openly available mass spectrometry (MS) data. However, there is no database-level control for false discovery of sites, likely leading to the overestimation of true phosphosites. By profiling the human phosphoproteome, we estimate the false discovery rate (FDR) of phosphosites and predict a more realistic count of true identifications. We rank sites into phosphorylation likelihood sets and analyze them in terms of conservation across 100 species, sequence properties, and functional annotations. We demonstrate significant differences between the sets and develop a method for independent phosphosite FDR estimation. Remarkably, we report estimated FDRs of 84, 98, and 82% within sets of phosphoserine (pSer), phosphothreonine (pThr), and phosphotyrosine (pTyr) sites, respectively, that are supported by only a single piece of identification evidence—the majority of sites in PSP. We estimate that around 62 000 Ser, 8000 Thr, and 12 000 Tyr phosphosites in the human proteome are likely to be true, which is lower than most published estimates. Furthermore, our analysis estimates that 86 000 Ser, 50 000 Thr, and 26 000 Tyr phosphosites are likely false-positive identifications, highlighting the significant potential of false-positive data to be present in phosphorylation databases.
format Online
Article
Text
id pubmed-9171898
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-91718982022-06-08 Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation Kalyuzhnyy, Anton Eyers, Patrick A. Eyers, Claire E. Bowler-Barnett, Emily Martin, Maria J. Sun, Zhi Deutsch, Eric W. Jones, Andrew R. J Proteome Res [Image: see text] Public phosphorylation databases such as PhosphoSitePlus (PSP) and PeptideAtlas (PA) compile results from published papers or openly available mass spectrometry (MS) data. However, there is no database-level control for false discovery of sites, likely leading to the overestimation of true phosphosites. By profiling the human phosphoproteome, we estimate the false discovery rate (FDR) of phosphosites and predict a more realistic count of true identifications. We rank sites into phosphorylation likelihood sets and analyze them in terms of conservation across 100 species, sequence properties, and functional annotations. We demonstrate significant differences between the sets and develop a method for independent phosphosite FDR estimation. Remarkably, we report estimated FDRs of 84, 98, and 82% within sets of phosphoserine (pSer), phosphothreonine (pThr), and phosphotyrosine (pTyr) sites, respectively, that are supported by only a single piece of identification evidence—the majority of sites in PSP. We estimate that around 62 000 Ser, 8000 Thr, and 12 000 Tyr phosphosites in the human proteome are likely to be true, which is lower than most published estimates. Furthermore, our analysis estimates that 86 000 Ser, 50 000 Thr, and 26 000 Tyr phosphosites are likely false-positive identifications, highlighting the significant potential of false-positive data to be present in phosphorylation databases. American Chemical Society 2022-05-09 2022-06-03 /pmc/articles/PMC9171898/ /pubmed/35532924 http://dx.doi.org/10.1021/acs.jproteome.2c00131 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Kalyuzhnyy, Anton
Eyers, Patrick A.
Eyers, Claire E.
Bowler-Barnett, Emily
Martin, Maria J.
Sun, Zhi
Deutsch, Eric W.
Jones, Andrew R.
Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation
title Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation
title_full Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation
title_fullStr Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation
title_full_unstemmed Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation
title_short Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation
title_sort profiling the human phosphoproteome to estimate the true extent of protein phosphorylation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9171898/
https://www.ncbi.nlm.nih.gov/pubmed/35532924
http://dx.doi.org/10.1021/acs.jproteome.2c00131
work_keys_str_mv AT kalyuzhnyyanton profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT eyerspatricka profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT eyersclairee profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT bowlerbarnettemily profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT martinmariaj profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT sunzhi profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT deutschericw profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation
AT jonesandrewr profilingthehumanphosphoproteometoestimatethetrueextentofproteinphosphorylation