Cargando…
Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
[Image: see text] Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphos...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243108/ https://www.ncbi.nlm.nih.gov/pubmed/37099386 http://dx.doi.org/10.1021/acs.jproteome.2c00823 |
_version_ | 1785054360667947008 |
---|---|
author | Camacho, Oscar M. Ramsbottom, Kerry A. Collins, Andrew Jones, Andrew R. |
author_facet | Camacho, Oscar M. Ramsbottom, Kerry A. Collins, Andrew Jones, Andrew R. |
author_sort | Camacho, Oscar M. |
collection | PubMed |
description | [Image: see text] Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphosites. The identification and localization of phosphosites emerge from different analytical pipelines and scoring algorithms, with uncertainty embedded throughout the pipeline. For many pipelines and algorithms, arbitrary thresholding is used, but little is known about the actual global false localization rate in these studies. Recently, it has been suggested to use decoy amino acids to estimate global false localization rates of phosphosites, among the peptide–spectrum matches reported. Here, we describe a simple pipeline aiming to maximize the information extracted from these studies by objectively collapsing from peptide–spectrum match to the peptidoform-site level, as well as combining findings from multiple studies while maintaining track of false localization rates. We show that the approach is more effective than current processes that use a simpler mechanism for handling phosphosite identification redundancy within and across studies. In our case study using eight rice phosphoproteomics data sets, 6368 unique sites were confidently identified using our decoy approach compared to 4687 using traditional thresholding in which false localization rates are unknown. |
format | Online Article Text |
id | pubmed-10243108 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-102431082023-06-07 Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets Camacho, Oscar M. Ramsbottom, Kerry A. Collins, Andrew Jones, Andrew R. J Proteome Res [Image: see text] Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphosites. The identification and localization of phosphosites emerge from different analytical pipelines and scoring algorithms, with uncertainty embedded throughout the pipeline. For many pipelines and algorithms, arbitrary thresholding is used, but little is known about the actual global false localization rate in these studies. Recently, it has been suggested to use decoy amino acids to estimate global false localization rates of phosphosites, among the peptide–spectrum matches reported. Here, we describe a simple pipeline aiming to maximize the information extracted from these studies by objectively collapsing from peptide–spectrum match to the peptidoform-site level, as well as combining findings from multiple studies while maintaining track of false localization rates. We show that the approach is more effective than current processes that use a simpler mechanism for handling phosphosite identification redundancy within and across studies. In our case study using eight rice phosphoproteomics data sets, 6368 unique sites were confidently identified using our decoy approach compared to 4687 using traditional thresholding in which false localization rates are unknown. American Chemical Society 2023-04-26 /pmc/articles/PMC10243108/ /pubmed/37099386 http://dx.doi.org/10.1021/acs.jproteome.2c00823 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Camacho, Oscar M. Ramsbottom, Kerry A. Collins, Andrew Jones, Andrew R. Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets |
title | Assessing
Multiple Evidence Streams to Decide on Confidence
for Identification of Post-Translational Modifications, within and
Across Data Sets |
title_full | Assessing
Multiple Evidence Streams to Decide on Confidence
for Identification of Post-Translational Modifications, within and
Across Data Sets |
title_fullStr | Assessing
Multiple Evidence Streams to Decide on Confidence
for Identification of Post-Translational Modifications, within and
Across Data Sets |
title_full_unstemmed | Assessing
Multiple Evidence Streams to Decide on Confidence
for Identification of Post-Translational Modifications, within and
Across Data Sets |
title_short | Assessing
Multiple Evidence Streams to Decide on Confidence
for Identification of Post-Translational Modifications, within and
Across Data Sets |
title_sort | assessing
multiple evidence streams to decide on confidence
for identification of post-translational modifications, within and
across data sets |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243108/ https://www.ncbi.nlm.nih.gov/pubmed/37099386 http://dx.doi.org/10.1021/acs.jproteome.2c00823 |
work_keys_str_mv | AT camachooscarm assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets AT ramsbottomkerrya assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets AT collinsandrew assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets AT jonesandrewr assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets |