Cargando…

Prediction of peptides observable by mass spectrometry applied at the experimental set level

BACKGROUND: When proteins are subjected to proteolytic digestion and analyzed by mass spectrometry using a method such as 2D LC MS/MS, only a portion of the proteotypic peptides associated with each protein will be observed. The ability to predict which peptides can and cannot potentially be observe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sanders, William S, Bridges, Susan M, McCarthy, Fiona M, Nanduri, Bindu, Burgess, Shane C
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099492/ https://www.ncbi.nlm.nih.gov/pubmed/18047723 http://dx.doi.org/10.1186/1471-2105-8-S7-S23

_version_	1782138318249525248
author	Sanders, William S Bridges, Susan M McCarthy, Fiona M Nanduri, Bindu Burgess, Shane C
author_facet	Sanders, William S Bridges, Susan M McCarthy, Fiona M Nanduri, Bindu Burgess, Shane C
author_sort	Sanders, William S
collection	PubMed
description	BACKGROUND: When proteins are subjected to proteolytic digestion and analyzed by mass spectrometry using a method such as 2D LC MS/MS, only a portion of the proteotypic peptides associated with each protein will be observed. The ability to predict which peptides can and cannot potentially be observed for a particular experimental dataset has several important applications in proteomics research including calculation of peptide coverage in terms of potentially detectable peptides, systems biology analysis of data sets, and protein quantification. RESULTS: We have developed a methodology for constructing artificial neural networks that can be used to predict which peptides are potentially observable for a given set of experimental, instrumental, and analytical conditions for 2D LC MS/MS (a.k.a Multidimensional Protein Identification Technology [MudPIT]) datasets. Neural network classifiers constructed using this procedure for two MudPIT datasets exhibit 10-fold cross validation accuracy of about 80%. We show that a classifier constructed for one dataset has poor predictive performance with the other dataset, thus demonstrating the need for dataset specific classifiers. Classification results with each dataset are used to compute informative percent amino acid coverage statistics for each protein in terms of the predicted detectable peptides in addition to the percent coverage of the complete sequence. We also demonstrate the utility of predicted peptide observability for systems analysis to help determine if proteins that were expected but not observed generate sufficient peptides for detection. CONCLUSION: Classifiers that accurately predict the likelihood of detecting proteotypic peptides by mass spectrometry provide proteomics researchers with powerful new approaches for data analysis. We demonstrate that the procedure we have developed for building a classifier based on an individual experimental data set results in classifiers with accuracy comparable to those reported in the literature based on large training sets collected from multiple experiments. Our approach allows the researcher to construct a classifier that is specific for the experimental, instrument, and analytical conditions of a single experiment and amenable to local, condition-specific, implementation. The resulting classifiers have application in a number of areas such as determination of peptide coverage for protein identification, pathway analysis, and protein quantification.
format	Text
id	pubmed-2099492
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-20994922007-12-01 Prediction of peptides observable by mass spectrometry applied at the experimental set level Sanders, William S Bridges, Susan M McCarthy, Fiona M Nanduri, Bindu Burgess, Shane C BMC Bioinformatics Proceedings BACKGROUND: When proteins are subjected to proteolytic digestion and analyzed by mass spectrometry using a method such as 2D LC MS/MS, only a portion of the proteotypic peptides associated with each protein will be observed. The ability to predict which peptides can and cannot potentially be observed for a particular experimental dataset has several important applications in proteomics research including calculation of peptide coverage in terms of potentially detectable peptides, systems biology analysis of data sets, and protein quantification. RESULTS: We have developed a methodology for constructing artificial neural networks that can be used to predict which peptides are potentially observable for a given set of experimental, instrumental, and analytical conditions for 2D LC MS/MS (a.k.a Multidimensional Protein Identification Technology [MudPIT]) datasets. Neural network classifiers constructed using this procedure for two MudPIT datasets exhibit 10-fold cross validation accuracy of about 80%. We show that a classifier constructed for one dataset has poor predictive performance with the other dataset, thus demonstrating the need for dataset specific classifiers. Classification results with each dataset are used to compute informative percent amino acid coverage statistics for each protein in terms of the predicted detectable peptides in addition to the percent coverage of the complete sequence. We also demonstrate the utility of predicted peptide observability for systems analysis to help determine if proteins that were expected but not observed generate sufficient peptides for detection. CONCLUSION: Classifiers that accurately predict the likelihood of detecting proteotypic peptides by mass spectrometry provide proteomics researchers with powerful new approaches for data analysis. We demonstrate that the procedure we have developed for building a classifier based on an individual experimental data set results in classifiers with accuracy comparable to those reported in the literature based on large training sets collected from multiple experiments. Our approach allows the researcher to construct a classifier that is specific for the experimental, instrument, and analytical conditions of a single experiment and amenable to local, condition-specific, implementation. The resulting classifiers have application in a number of areas such as determination of peptide coverage for protein identification, pathway analysis, and protein quantification. BioMed Central 2007-11-01 /pmc/articles/PMC2099492/ /pubmed/18047723 http://dx.doi.org/10.1186/1471-2105-8-S7-S23 Text en Copyright © 2007 Sanders et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Sanders, William S Bridges, Susan M McCarthy, Fiona M Nanduri, Bindu Burgess, Shane C Prediction of peptides observable by mass spectrometry applied at the experimental set level
title	Prediction of peptides observable by mass spectrometry applied at the experimental set level
title_full	Prediction of peptides observable by mass spectrometry applied at the experimental set level
title_fullStr	Prediction of peptides observable by mass spectrometry applied at the experimental set level
title_full_unstemmed	Prediction of peptides observable by mass spectrometry applied at the experimental set level
title_short	Prediction of peptides observable by mass spectrometry applied at the experimental set level
title_sort	prediction of peptides observable by mass spectrometry applied at the experimental set level
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099492/ https://www.ncbi.nlm.nih.gov/pubmed/18047723 http://dx.doi.org/10.1186/1471-2105-8-S7-S23
work_keys_str_mv	AT sanderswilliams predictionofpeptidesobservablebymassspectrometryappliedattheexperimentalsetlevel AT bridgessusanm predictionofpeptidesobservablebymassspectrometryappliedattheexperimentalsetlevel AT mccarthyfionam predictionofpeptidesobservablebymassspectrometryappliedattheexperimentalsetlevel AT nanduribindu predictionofpeptidesobservablebymassspectrometryappliedattheexperimentalsetlevel AT burgessshanec predictionofpeptidesobservablebymassspectrometryappliedattheexperimentalsetlevel

Prediction of peptides observable by mass spectrometry applied at the experimental set level

Ejemplares similares