Cargando…

SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering

Isobaric stable isotope labeling techniques such as tandem mass tags (TMTs) have become popular in proteomics because they enable the relative quantification of proteins with high precision from up to 18 samples in a single experiment. While missing values in peptide quantification are rare in a sin...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamood, Firas, Bayer, Florian P., Wilhelm, Mathias, Kuster, Bernhard, The, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Biochemistry and Molecular Biology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9389303/
https://www.ncbi.nlm.nih.gov/pubmed/35462064
http://dx.doi.org/10.1016/j.mcpro.2022.100238
_version_ 1784770414806827008
author Hamood, Firas
Bayer, Florian P.
Wilhelm, Mathias
Kuster, Bernhard
The, Matthew
author_facet Hamood, Firas
Bayer, Florian P.
Wilhelm, Mathias
Kuster, Bernhard
The, Matthew
author_sort Hamood, Firas
collection PubMed
description Isobaric stable isotope labeling techniques such as tandem mass tags (TMTs) have become popular in proteomics because they enable the relative quantification of proteins with high precision from up to 18 samples in a single experiment. While missing values in peptide quantification are rare in a single TMT experiment, they rapidly increase when combining multiple TMT experiments. As the field moves toward analyzing ever higher numbers of samples, tools that reduce missing values also become more important for analyzing TMT datasets. To this end, we developed SIMSI-Transfer (Similarity-based Isobaric Mass Spectra 2 [MS2] Identification Transfer), a software tool that extends our previously developed software MaRaCluster (© Matthew The) by clustering similar tandem MS2 from multiple TMT experiments. SIMSI-Transfer is based on the assumption that similarity-clustered MS2 spectra represent the same peptide. Therefore, peptide identifications made by database searching in one TMT batch can be transferred to another TMT batch in which the same peptide was fragmented but not identified. To assess the validity of this approach, we tested SIMSI-Transfer on masked search engine identification results and recovered >80% of the masked identifications while controlling errors in the transfer procedure to below 1% false discovery rate. Applying SIMSI-Transfer to six published full proteome and phosphoproteome datasets from the Clinical Proteomic Tumor Analysis Consortium led to an increase of 26 to 45% of identified MS2 spectra with TMT quantifications. This significantly decreased the number of missing values across batches and, in turn, increased the number of peptides and proteins identified in all TMT batches by 43 to 56% and 13 to 16%, respectively.
format Online
Article
Text
id pubmed-9389303
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-93893032022-08-22 SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering Hamood, Firas Bayer, Florian P. Wilhelm, Mathias Kuster, Bernhard The, Matthew Mol Cell Proteomics Technological Innovation and Resources Isobaric stable isotope labeling techniques such as tandem mass tags (TMTs) have become popular in proteomics because they enable the relative quantification of proteins with high precision from up to 18 samples in a single experiment. While missing values in peptide quantification are rare in a single TMT experiment, they rapidly increase when combining multiple TMT experiments. As the field moves toward analyzing ever higher numbers of samples, tools that reduce missing values also become more important for analyzing TMT datasets. To this end, we developed SIMSI-Transfer (Similarity-based Isobaric Mass Spectra 2 [MS2] Identification Transfer), a software tool that extends our previously developed software MaRaCluster (© Matthew The) by clustering similar tandem MS2 from multiple TMT experiments. SIMSI-Transfer is based on the assumption that similarity-clustered MS2 spectra represent the same peptide. Therefore, peptide identifications made by database searching in one TMT batch can be transferred to another TMT batch in which the same peptide was fragmented but not identified. To assess the validity of this approach, we tested SIMSI-Transfer on masked search engine identification results and recovered >80% of the masked identifications while controlling errors in the transfer procedure to below 1% false discovery rate. Applying SIMSI-Transfer to six published full proteome and phosphoproteome datasets from the Clinical Proteomic Tumor Analysis Consortium led to an increase of 26 to 45% of identified MS2 spectra with TMT quantifications. This significantly decreased the number of missing values across batches and, in turn, increased the number of peptides and proteins identified in all TMT batches by 43 to 56% and 13 to 16%, respectively. American Society for Biochemistry and Molecular Biology 2022-04-21 /pmc/articles/PMC9389303/ /pubmed/35462064 http://dx.doi.org/10.1016/j.mcpro.2022.100238 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Technological Innovation and Resources
Hamood, Firas
Bayer, Florian P.
Wilhelm, Mathias
Kuster, Bernhard
The, Matthew
SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering
title SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering
title_full SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering
title_fullStr SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering
title_full_unstemmed SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering
title_short SIMSI-Transfer: Software-Assisted Reduction of Missing Values in Phosphoproteomic and Proteomic Isobaric Labeling Data Using Tandem Mass Spectrum Clustering
title_sort simsi-transfer: software-assisted reduction of missing values in phosphoproteomic and proteomic isobaric labeling data using tandem mass spectrum clustering
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9389303/
https://www.ncbi.nlm.nih.gov/pubmed/35462064
http://dx.doi.org/10.1016/j.mcpro.2022.100238
work_keys_str_mv AT hamoodfiras simsitransfersoftwareassistedreductionofmissingvaluesinphosphoproteomicandproteomicisobariclabelingdatausingtandemmassspectrumclustering
AT bayerflorianp simsitransfersoftwareassistedreductionofmissingvaluesinphosphoproteomicandproteomicisobariclabelingdatausingtandemmassspectrumclustering
AT wilhelmmathias simsitransfersoftwareassistedreductionofmissingvaluesinphosphoproteomicandproteomicisobariclabelingdatausingtandemmassspectrumclustering
AT kusterbernhard simsitransfersoftwareassistedreductionofmissingvaluesinphosphoproteomicandproteomicisobariclabelingdatausingtandemmassspectrumclustering
AT thematthew simsitransfersoftwareassistedreductionofmissingvaluesinphosphoproteomicandproteomicisobariclabelingdatausingtandemmassspectrumclustering