Cargando…

Computational refinement of post-translational modifications predicted from tandem mass spectrometry

Motivation: A post-translational modification (PTM) is a chemical modification of a protein that occurs naturally. Many of these modifications, such as phosphorylation, are known to play pivotal roles in the regulation of protein function. Henceforth, PTM perturbations have been linked to diverse di...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Clement, Liu, Jian, Emili, Andrew, Frey, Brendan J.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051323/
https://www.ncbi.nlm.nih.gov/pubmed/21258065
http://dx.doi.org/10.1093/bioinformatics/btr017
_version_ 1782199470648197120
author Chung, Clement
Liu, Jian
Emili, Andrew
Frey, Brendan J.
author_facet Chung, Clement
Liu, Jian
Emili, Andrew
Frey, Brendan J.
author_sort Chung, Clement
collection PubMed
description Motivation: A post-translational modification (PTM) is a chemical modification of a protein that occurs naturally. Many of these modifications, such as phosphorylation, are known to play pivotal roles in the regulation of protein function. Henceforth, PTM perturbations have been linked to diverse diseases like Parkinson's, Alzheimer's, diabetes and cancer. To discover PTMs on a genome-wide scale, there is a recent surge of interest in analyzing tandem mass spectrometry data, and several unrestrictive (so-called ‘blind’) PTM search methods have been reported. However, these approaches are subject to noise in mass measurements and in the predicted modification site (amino acid position) within peptides, which can result in false PTM assignments. Results: To address these issues, we devised a machine learning algorithm, PTMClust, that can be applied to the output of blind PTM search methods to improve prediction quality, by suppressing noise in the data and clustering peptides with the same underlying modification to form PTM groups. We show that our technique outperforms two standard clustering algorithms on a simulated dataset. Additionally, we show that our algorithm significantly improves sensitivity and specificity when applied to the output of three different blind PTM search engines, SIMS, InsPecT and MODmap. Additionally, PTMClust markedly outperforms another PTM refinement algorithm, PTMFinder. We demonstrate that our technique is able to reduce false PTM assignments, improve overall detection coverage and facilitate novel PTM discovery, including terminus modifications. We applied our technique to a large-scale yeast MS/MS proteome profiling dataset and found numerous known and novel PTMs. Accurately identifying modifications in protein sequences is a critical first step for PTM profiling, and thus our approach may benefit routine proteomic analysis. Availability: Our algorithm is implemented in Matlab and is freely available for academic use. The software is available online from http://genes.toronto.edu. Supplementary Information: Supplementary data are available at Bioinformatics online. Contact: frey@psi.utoronto.ca
format Text
id pubmed-3051323
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30513232011-03-10 Computational refinement of post-translational modifications predicted from tandem mass spectrometry Chung, Clement Liu, Jian Emili, Andrew Frey, Brendan J. Bioinformatics Original Papers Motivation: A post-translational modification (PTM) is a chemical modification of a protein that occurs naturally. Many of these modifications, such as phosphorylation, are known to play pivotal roles in the regulation of protein function. Henceforth, PTM perturbations have been linked to diverse diseases like Parkinson's, Alzheimer's, diabetes and cancer. To discover PTMs on a genome-wide scale, there is a recent surge of interest in analyzing tandem mass spectrometry data, and several unrestrictive (so-called ‘blind’) PTM search methods have been reported. However, these approaches are subject to noise in mass measurements and in the predicted modification site (amino acid position) within peptides, which can result in false PTM assignments. Results: To address these issues, we devised a machine learning algorithm, PTMClust, that can be applied to the output of blind PTM search methods to improve prediction quality, by suppressing noise in the data and clustering peptides with the same underlying modification to form PTM groups. We show that our technique outperforms two standard clustering algorithms on a simulated dataset. Additionally, we show that our algorithm significantly improves sensitivity and specificity when applied to the output of three different blind PTM search engines, SIMS, InsPecT and MODmap. Additionally, PTMClust markedly outperforms another PTM refinement algorithm, PTMFinder. We demonstrate that our technique is able to reduce false PTM assignments, improve overall detection coverage and facilitate novel PTM discovery, including terminus modifications. We applied our technique to a large-scale yeast MS/MS proteome profiling dataset and found numerous known and novel PTMs. Accurately identifying modifications in protein sequences is a critical first step for PTM profiling, and thus our approach may benefit routine proteomic analysis. Availability: Our algorithm is implemented in Matlab and is freely available for academic use. The software is available online from http://genes.toronto.edu. Supplementary Information: Supplementary data are available at Bioinformatics online. Contact: frey@psi.utoronto.ca Oxford University Press 2011-03-15 2011-01-22 /pmc/articles/PMC3051323/ /pubmed/21258065 http://dx.doi.org/10.1093/bioinformatics/btr017 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Chung, Clement
Liu, Jian
Emili, Andrew
Frey, Brendan J.
Computational refinement of post-translational modifications predicted from tandem mass spectrometry
title Computational refinement of post-translational modifications predicted from tandem mass spectrometry
title_full Computational refinement of post-translational modifications predicted from tandem mass spectrometry
title_fullStr Computational refinement of post-translational modifications predicted from tandem mass spectrometry
title_full_unstemmed Computational refinement of post-translational modifications predicted from tandem mass spectrometry
title_short Computational refinement of post-translational modifications predicted from tandem mass spectrometry
title_sort computational refinement of post-translational modifications predicted from tandem mass spectrometry
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051323/
https://www.ncbi.nlm.nih.gov/pubmed/21258065
http://dx.doi.org/10.1093/bioinformatics/btr017
work_keys_str_mv AT chungclement computationalrefinementofposttranslationalmodificationspredictedfromtandemmassspectrometry
AT liujian computationalrefinementofposttranslationalmodificationspredictedfromtandemmassspectrometry
AT emiliandrew computationalrefinementofposttranslationalmodificationspredictedfromtandemmassspectrometry
AT freybrendanj computationalrefinementofposttranslationalmodificationspredictedfromtandemmassspectrometry