Cargando…

Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis

Unraveling the functional dynamics of phosphorylation networks is a crucial step in understanding the way in which biological networks form a living cell. Recently there has been an enormous increase in the number of measured phosphorylation events. Nevertheless, comparative and integrative analysis...

Descripción completa

Detalles Bibliográficos
Autores principales: Boekhorst, Jos, Boersema, Paul J., Tops, Bastiaan B. J., van Breukelen, Bas, Heck, Albert J. R., Snel, Berend
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154393/
https://www.ncbi.nlm.nih.gov/pubmed/21853102
http://dx.doi.org/10.1371/journal.pone.0023276
_version_ 1782210014079877120
author Boekhorst, Jos
Boersema, Paul J.
Tops, Bastiaan B. J.
van Breukelen, Bas
Heck, Albert J. R.
Snel, Berend
author_facet Boekhorst, Jos
Boersema, Paul J.
Tops, Bastiaan B. J.
van Breukelen, Bas
Heck, Albert J. R.
Snel, Berend
author_sort Boekhorst, Jos
collection PubMed
description Unraveling the functional dynamics of phosphorylation networks is a crucial step in understanding the way in which biological networks form a living cell. Recently there has been an enormous increase in the number of measured phosphorylation events. Nevertheless, comparative and integrative analysis of phosphoproteomes is confounded by incomplete coverage and biases introduced by different experimental workflows. As a result, we cannot differentiate whether phosphosites indentified in only one or two samples are the result of condition or species specific phosphorylation, or reflect missing data. Here, we evaluate the impact of incomplete phosphoproteomics datasets on comparative analysis, and we present bioinformatics strategies to quantify the impact of different experimental workflows on measured phosphoproteomes. We show that plotting the saturation in observed phosphosites in replicates provides a reproducible picture of the extent of a particular phosphoproteome. Still, we are still far away from a complete picture of the total human phosphoproteome. The impact of different experimental techniques on the similarity between phosphoproteomes can be estimated by comparing datasets from different experimental pipelines to a common reference. Our results show that comparative analysis is most powerful when datasets have been generated using the same experimental workflow. We show this experimentally by measuring the tyrosine phosphoproteome from Caenorhabditis elegans and comparing it to the tyrosine phosphoproteome of HeLa cells, resulting in an overlap of about 4%. This overlap between very different organisms represents a three-fold increase when compared to dataset of older studies, wherein different workflows were used. The strategies we suggest enable an estimation of the impact of differences in experimental workflows on the overlap between datasets. This will allow us to perform comparative analyses not only on datasets specifically generated for this purpose, but also to extract insights through comparative analysis of the ever-increasing wealth of publically available phosphorylation data.
format Online
Article
Text
id pubmed-3154393
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31543932011-08-18 Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis Boekhorst, Jos Boersema, Paul J. Tops, Bastiaan B. J. van Breukelen, Bas Heck, Albert J. R. Snel, Berend PLoS One Research Article Unraveling the functional dynamics of phosphorylation networks is a crucial step in understanding the way in which biological networks form a living cell. Recently there has been an enormous increase in the number of measured phosphorylation events. Nevertheless, comparative and integrative analysis of phosphoproteomes is confounded by incomplete coverage and biases introduced by different experimental workflows. As a result, we cannot differentiate whether phosphosites indentified in only one or two samples are the result of condition or species specific phosphorylation, or reflect missing data. Here, we evaluate the impact of incomplete phosphoproteomics datasets on comparative analysis, and we present bioinformatics strategies to quantify the impact of different experimental workflows on measured phosphoproteomes. We show that plotting the saturation in observed phosphosites in replicates provides a reproducible picture of the extent of a particular phosphoproteome. Still, we are still far away from a complete picture of the total human phosphoproteome. The impact of different experimental techniques on the similarity between phosphoproteomes can be estimated by comparing datasets from different experimental pipelines to a common reference. Our results show that comparative analysis is most powerful when datasets have been generated using the same experimental workflow. We show this experimentally by measuring the tyrosine phosphoproteome from Caenorhabditis elegans and comparing it to the tyrosine phosphoproteome of HeLa cells, resulting in an overlap of about 4%. This overlap between very different organisms represents a three-fold increase when compared to dataset of older studies, wherein different workflows were used. The strategies we suggest enable an estimation of the impact of differences in experimental workflows on the overlap between datasets. This will allow us to perform comparative analyses not only on datasets specifically generated for this purpose, but also to extract insights through comparative analysis of the ever-increasing wealth of publically available phosphorylation data. Public Library of Science 2011-08-10 /pmc/articles/PMC3154393/ /pubmed/21853102 http://dx.doi.org/10.1371/journal.pone.0023276 Text en Boekhorst et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Boekhorst, Jos
Boersema, Paul J.
Tops, Bastiaan B. J.
van Breukelen, Bas
Heck, Albert J. R.
Snel, Berend
Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis
title Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis
title_full Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis
title_fullStr Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis
title_full_unstemmed Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis
title_short Evaluating Experimental Bias and Completeness in Comparative Phosphoproteomics Analysis
title_sort evaluating experimental bias and completeness in comparative phosphoproteomics analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154393/
https://www.ncbi.nlm.nih.gov/pubmed/21853102
http://dx.doi.org/10.1371/journal.pone.0023276
work_keys_str_mv AT boekhorstjos evaluatingexperimentalbiasandcompletenessincomparativephosphoproteomicsanalysis
AT boersemapaulj evaluatingexperimentalbiasandcompletenessincomparativephosphoproteomicsanalysis
AT topsbastiaanbj evaluatingexperimentalbiasandcompletenessincomparativephosphoproteomicsanalysis
AT vanbreukelenbas evaluatingexperimentalbiasandcompletenessincomparativephosphoproteomicsanalysis
AT heckalbertjr evaluatingexperimentalbiasandcompletenessincomparativephosphoproteomicsanalysis
AT snelberend evaluatingexperimentalbiasandcompletenessincomparativephosphoproteomicsanalysis