Cargando…

Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes

BACKGROUND: Phosphorylation is the most frequent post-translational modification made to proteins and may regulate protein activity as either a molecular digital switch or a rheostat. Despite the cornucopia of high-throughput (HTP) phosphoproteomic data in the last decade, it remains unclear how man...

Descripción completa

Detalles Bibliográficos
Autores principales: Vlastaridis, Panayotis, Kyriakidou, Pelagia, Chaliotis, Anargyros, Van de Peer, Yves, Oliver, Stephen G, Amoutzias, Grigoris D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5466708/
https://www.ncbi.nlm.nih.gov/pubmed/28327990
http://dx.doi.org/10.1093/gigascience/giw015
_version_ 1783243142687883264
author Vlastaridis, Panayotis
Kyriakidou, Pelagia
Chaliotis, Anargyros
Van de Peer, Yves
Oliver, Stephen G
Amoutzias, Grigoris D
author_facet Vlastaridis, Panayotis
Kyriakidou, Pelagia
Chaliotis, Anargyros
Van de Peer, Yves
Oliver, Stephen G
Amoutzias, Grigoris D
author_sort Vlastaridis, Panayotis
collection PubMed
description BACKGROUND: Phosphorylation is the most frequent post-translational modification made to proteins and may regulate protein activity as either a molecular digital switch or a rheostat. Despite the cornucopia of high-throughput (HTP) phosphoproteomic data in the last decade, it remains unclear how many proteins are phosphorylated and how many phosphorylation sites (p-sites) can exist in total within a eukaryotic proteome. We present the first reliable estimates of the total number of phosphoproteins and p-sites for four eukaryotes (human, mouse, Arabidopsis, and yeast). RESULTS: In all, 187 HTP phosphoproteomic datasets were filtered, compiled, and studied along with two low-throughput (LTP) compendia. Estimates of the number of phosphoproteins and p-sites were inferred by two methods: Capture-Recapture, and fitting the saturation curve of cumulative redundant vs. cumulative non-redundant phosphoproteins/p-sites. Estimates were also adjusted for different levels of noise within the individual datasets and other confounding factors. We estimate that in total, 13 000, 11 000, and 3000 phosphoproteins and 230 000, 156 000, and 40 000 p-sites exist in human, mouse, and yeast, respectively, whereas estimates for Arabidopsis were not as reliable. CONCLUSIONS: Most of the phosphoproteins have been discovered for human, mouse, and yeast, while the dataset for Arabidopsis is still far from complete. The datasets for p-sites are not as close to saturation as those for phosphoproteins. Integration of the LTP data suggests that current HTP phosphoproteomics appears to be capable of capturing 70 % to 95 % of total phosphoproteins, but only 40 % to 60 % of total p-sites.
format Online
Article
Text
id pubmed-5466708
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54667082017-06-19 Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes Vlastaridis, Panayotis Kyriakidou, Pelagia Chaliotis, Anargyros Van de Peer, Yves Oliver, Stephen G Amoutzias, Grigoris D Gigascience Research BACKGROUND: Phosphorylation is the most frequent post-translational modification made to proteins and may regulate protein activity as either a molecular digital switch or a rheostat. Despite the cornucopia of high-throughput (HTP) phosphoproteomic data in the last decade, it remains unclear how many proteins are phosphorylated and how many phosphorylation sites (p-sites) can exist in total within a eukaryotic proteome. We present the first reliable estimates of the total number of phosphoproteins and p-sites for four eukaryotes (human, mouse, Arabidopsis, and yeast). RESULTS: In all, 187 HTP phosphoproteomic datasets were filtered, compiled, and studied along with two low-throughput (LTP) compendia. Estimates of the number of phosphoproteins and p-sites were inferred by two methods: Capture-Recapture, and fitting the saturation curve of cumulative redundant vs. cumulative non-redundant phosphoproteins/p-sites. Estimates were also adjusted for different levels of noise within the individual datasets and other confounding factors. We estimate that in total, 13 000, 11 000, and 3000 phosphoproteins and 230 000, 156 000, and 40 000 p-sites exist in human, mouse, and yeast, respectively, whereas estimates for Arabidopsis were not as reliable. CONCLUSIONS: Most of the phosphoproteins have been discovered for human, mouse, and yeast, while the dataset for Arabidopsis is still far from complete. The datasets for p-sites are not as close to saturation as those for phosphoproteins. Integration of the LTP data suggests that current HTP phosphoproteomics appears to be capable of capturing 70 % to 95 % of total phosphoproteins, but only 40 % to 60 % of total p-sites. Oxford University Press 2017-01-07 /pmc/articles/PMC5466708/ /pubmed/28327990 http://dx.doi.org/10.1093/gigascience/giw015 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Vlastaridis, Panayotis
Kyriakidou, Pelagia
Chaliotis, Anargyros
Van de Peer, Yves
Oliver, Stephen G
Amoutzias, Grigoris D
Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
title Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
title_full Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
title_fullStr Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
title_full_unstemmed Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
title_short Estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
title_sort estimating the total number of phosphoproteins and phosphorylation sites in eukaryotic proteomes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5466708/
https://www.ncbi.nlm.nih.gov/pubmed/28327990
http://dx.doi.org/10.1093/gigascience/giw015
work_keys_str_mv AT vlastaridispanayotis estimatingthetotalnumberofphosphoproteinsandphosphorylationsitesineukaryoticproteomes
AT kyriakidoupelagia estimatingthetotalnumberofphosphoproteinsandphosphorylationsitesineukaryoticproteomes
AT chaliotisanargyros estimatingthetotalnumberofphosphoproteinsandphosphorylationsitesineukaryoticproteomes
AT vandepeeryves estimatingthetotalnumberofphosphoproteinsandphosphorylationsitesineukaryoticproteomes
AT oliverstepheng estimatingthetotalnumberofphosphoproteinsandphosphorylationsitesineukaryoticproteomes
AT amoutziasgrigorisd estimatingthetotalnumberofphosphoproteinsandphosphorylationsitesineukaryoticproteomes