Cargando…

Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets

[Image: see text] Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphos...

Descripción completa

Detalles Bibliográficos
Autores principales: Camacho, Oscar M., Ramsbottom, Kerry A., Collins, Andrew, Jones, Andrew R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243108/
https://www.ncbi.nlm.nih.gov/pubmed/37099386
http://dx.doi.org/10.1021/acs.jproteome.2c00823
_version_ 1785054360667947008
author Camacho, Oscar M.
Ramsbottom, Kerry A.
Collins, Andrew
Jones, Andrew R.
author_facet Camacho, Oscar M.
Ramsbottom, Kerry A.
Collins, Andrew
Jones, Andrew R.
author_sort Camacho, Oscar M.
collection PubMed
description [Image: see text] Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphosites. The identification and localization of phosphosites emerge from different analytical pipelines and scoring algorithms, with uncertainty embedded throughout the pipeline. For many pipelines and algorithms, arbitrary thresholding is used, but little is known about the actual global false localization rate in these studies. Recently, it has been suggested to use decoy amino acids to estimate global false localization rates of phosphosites, among the peptide–spectrum matches reported. Here, we describe a simple pipeline aiming to maximize the information extracted from these studies by objectively collapsing from peptide–spectrum match to the peptidoform-site level, as well as combining findings from multiple studies while maintaining track of false localization rates. We show that the approach is more effective than current processes that use a simpler mechanism for handling phosphosite identification redundancy within and across studies. In our case study using eight rice phosphoproteomics data sets, 6368 unique sites were confidently identified using our decoy approach compared to 4687 using traditional thresholding in which false localization rates are unknown.
format Online
Article
Text
id pubmed-10243108
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-102431082023-06-07 Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets Camacho, Oscar M. Ramsbottom, Kerry A. Collins, Andrew Jones, Andrew R. J Proteome Res [Image: see text] Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphosites. The identification and localization of phosphosites emerge from different analytical pipelines and scoring algorithms, with uncertainty embedded throughout the pipeline. For many pipelines and algorithms, arbitrary thresholding is used, but little is known about the actual global false localization rate in these studies. Recently, it has been suggested to use decoy amino acids to estimate global false localization rates of phosphosites, among the peptide–spectrum matches reported. Here, we describe a simple pipeline aiming to maximize the information extracted from these studies by objectively collapsing from peptide–spectrum match to the peptidoform-site level, as well as combining findings from multiple studies while maintaining track of false localization rates. We show that the approach is more effective than current processes that use a simpler mechanism for handling phosphosite identification redundancy within and across studies. In our case study using eight rice phosphoproteomics data sets, 6368 unique sites were confidently identified using our decoy approach compared to 4687 using traditional thresholding in which false localization rates are unknown. American Chemical Society 2023-04-26 /pmc/articles/PMC10243108/ /pubmed/37099386 http://dx.doi.org/10.1021/acs.jproteome.2c00823 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Camacho, Oscar M.
Ramsbottom, Kerry A.
Collins, Andrew
Jones, Andrew R.
Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
title Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
title_full Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
title_fullStr Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
title_full_unstemmed Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
title_short Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets
title_sort assessing multiple evidence streams to decide on confidence for identification of post-translational modifications, within and across data sets
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243108/
https://www.ncbi.nlm.nih.gov/pubmed/37099386
http://dx.doi.org/10.1021/acs.jproteome.2c00823
work_keys_str_mv AT camachooscarm assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets
AT ramsbottomkerrya assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets
AT collinsandrew assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets
AT jonesandrewr assessingmultipleevidencestreamstodecideonconfidenceforidentificationofposttranslationalmodificationswithinandacrossdatasets