Cargando…

Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics

BACKGROUND: High-throughput peptide and protein identification technologies have benefited tremendously from strategies based on tandem mass spectrometry (MS/MS) in combination with database searching algorithms. A major problem with existing methods lies within the significant number of false posit...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pfeifer, Nico, Leinenbach, Andreas, Huber, Christian G, Kohlbacher, Oliver
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2254445/ https://www.ncbi.nlm.nih.gov/pubmed/18053132 http://dx.doi.org/10.1186/1471-2105-8-468

_version_	1782151190271754240
author	Pfeifer, Nico Leinenbach, Andreas Huber, Christian G Kohlbacher, Oliver
author_facet	Pfeifer, Nico Leinenbach, Andreas Huber, Christian G Kohlbacher, Oliver
author_sort	Pfeifer, Nico
collection	PubMed
description	BACKGROUND: High-throughput peptide and protein identification technologies have benefited tremendously from strategies based on tandem mass spectrometry (MS/MS) in combination with database searching algorithms. A major problem with existing methods lies within the significant number of false positive and false negative annotations. So far, standard algorithms for protein identification do not use the information gained from separation processes usually involved in peptide analysis, such as retention time information, which are readily available from chromatographic separation of the sample. Identification can thus be improved by comparing measured retention times to predicted retention times. Current prediction models are derived from a set of measured test analytes but they usually require large amounts of training data. RESULTS: We introduce a new kernel function which can be applied in combination with support vector machines to a wide range of computational proteomics problems. We show the performance of this new approach by applying it to the prediction of peptide adsorption/elution behavior in strong anion-exchange solid-phase extraction (SAX-SPE) and ion-pair reversed-phase high-performance liquid chromatography (IP-RP-HPLC). Furthermore, the predicted retention times are used to improve spectrum identifications by a p-value-based filtering approach. The approach was tested on a number of different datasets and shows excellent performance while requiring only very small training sets (about 40 peptides instead of thousands). Using the retention time predictor in our retention time filter improves the fraction of correctly identified peptide mass spectra significantly. CONCLUSION: The proposed kernel function is well-suited for the prediction of chromatographic separation in computational proteomics and requires only a limited amount of training data. The performance of this new method is demonstrated by applying it to peptide retention time prediction in IP-RP-HPLC and prediction of peptide sample fractionation in SAX-SPE. Finally, we incorporate the predicted chromatographic behavior in a p-value based filter to improve peptide identifications based on liquid chromatography-tandem mass spectrometry.
format	Text
id	pubmed-2254445
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-22544452008-02-26 Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics Pfeifer, Nico Leinenbach, Andreas Huber, Christian G Kohlbacher, Oliver BMC Bioinformatics Research Article BACKGROUND: High-throughput peptide and protein identification technologies have benefited tremendously from strategies based on tandem mass spectrometry (MS/MS) in combination with database searching algorithms. A major problem with existing methods lies within the significant number of false positive and false negative annotations. So far, standard algorithms for protein identification do not use the information gained from separation processes usually involved in peptide analysis, such as retention time information, which are readily available from chromatographic separation of the sample. Identification can thus be improved by comparing measured retention times to predicted retention times. Current prediction models are derived from a set of measured test analytes but they usually require large amounts of training data. RESULTS: We introduce a new kernel function which can be applied in combination with support vector machines to a wide range of computational proteomics problems. We show the performance of this new approach by applying it to the prediction of peptide adsorption/elution behavior in strong anion-exchange solid-phase extraction (SAX-SPE) and ion-pair reversed-phase high-performance liquid chromatography (IP-RP-HPLC). Furthermore, the predicted retention times are used to improve spectrum identifications by a p-value-based filtering approach. The approach was tested on a number of different datasets and shows excellent performance while requiring only very small training sets (about 40 peptides instead of thousands). Using the retention time predictor in our retention time filter improves the fraction of correctly identified peptide mass spectra significantly. CONCLUSION: The proposed kernel function is well-suited for the prediction of chromatographic separation in computational proteomics and requires only a limited amount of training data. The performance of this new method is demonstrated by applying it to peptide retention time prediction in IP-RP-HPLC and prediction of peptide sample fractionation in SAX-SPE. Finally, we incorporate the predicted chromatographic behavior in a p-value based filter to improve peptide identifications based on liquid chromatography-tandem mass spectrometry. BioMed Central 2007-11-30 /pmc/articles/PMC2254445/ /pubmed/18053132 http://dx.doi.org/10.1186/1471-2105-8-468 Text en Copyright © 2007 Pfeifer et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Pfeifer, Nico Leinenbach, Andreas Huber, Christian G Kohlbacher, Oliver Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
title	Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
title_full	Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
title_fullStr	Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
title_full_unstemmed	Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
title_short	Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
title_sort	statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2254445/ https://www.ncbi.nlm.nih.gov/pubmed/18053132 http://dx.doi.org/10.1186/1471-2105-8-468
work_keys_str_mv	AT pfeifernico statisticallearningofpeptideretentionbehaviorinchromatographicseparationsanewkernelbasedapproachforcomputationalproteomics AT leinenbachandreas statisticallearningofpeptideretentionbehaviorinchromatographicseparationsanewkernelbasedapproachforcomputationalproteomics AT huberchristiang statisticallearningofpeptideretentionbehaviorinchromatographicseparationsanewkernelbasedapproachforcomputationalproteomics AT kohlbacheroliver statisticallearningofpeptideretentionbehaviorinchromatographicseparationsanewkernelbasedapproachforcomputationalproteomics

Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics

Ejemplares similares