Cargando…

Score regularization for peptide identification

BACKGROUND: Peptide identification from tandem mass spectrometry (MS/MS) data is one of the most important problems in computational proteomics. This technique relies heavily on the accurate assessment of the quality of peptide-spectrum matches (PSMs). However, current MS technology and PSM scoring...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Zengyou, Zhao, Hongyu, Yu, Weichuan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3044274/
https://www.ncbi.nlm.nih.gov/pubmed/21342549
http://dx.doi.org/10.1186/1471-2105-12-S1-S2
_version_ 1782198708348125184
author He, Zengyou
Zhao, Hongyu
Yu, Weichuan
author_facet He, Zengyou
Zhao, Hongyu
Yu, Weichuan
author_sort He, Zengyou
collection PubMed
description BACKGROUND: Peptide identification from tandem mass spectrometry (MS/MS) data is one of the most important problems in computational proteomics. This technique relies heavily on the accurate assessment of the quality of peptide-spectrum matches (PSMs). However, current MS technology and PSM scoring algorithm are far from perfect, leading to the generation of incorrect peptide-spectrum pairs. Thus, it is critical to develop new post-processing techniques that can distinguish true identifications from false identifications effectively. RESULTS: In this paper, we present a consistency-based PSM re-ranking method to improve the initial identification results. This method uses one additional assumption that two peptides belonging to the same protein should be correlated to each other. We formulate an optimization problem that embraces two objectives through regularization: the smoothing consistency among scores of correlated peptides and the fitting consistency between new scores and initial scores. This optimization problem can be solved analytically. The experimental study on several real MS/MS data sets shows that this re-ranking method improves the identification performance. CONCLUSIONS: The score regularization method can be used as a general post-processing step for improving peptide identifications. Source codes and data sets are available at: http://bioinformatics.ust.hk/SRPI.rar.
format Text
id pubmed-3044274
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30442742011-02-25 Score regularization for peptide identification He, Zengyou Zhao, Hongyu Yu, Weichuan BMC Bioinformatics Research BACKGROUND: Peptide identification from tandem mass spectrometry (MS/MS) data is one of the most important problems in computational proteomics. This technique relies heavily on the accurate assessment of the quality of peptide-spectrum matches (PSMs). However, current MS technology and PSM scoring algorithm are far from perfect, leading to the generation of incorrect peptide-spectrum pairs. Thus, it is critical to develop new post-processing techniques that can distinguish true identifications from false identifications effectively. RESULTS: In this paper, we present a consistency-based PSM re-ranking method to improve the initial identification results. This method uses one additional assumption that two peptides belonging to the same protein should be correlated to each other. We formulate an optimization problem that embraces two objectives through regularization: the smoothing consistency among scores of correlated peptides and the fitting consistency between new scores and initial scores. This optimization problem can be solved analytically. The experimental study on several real MS/MS data sets shows that this re-ranking method improves the identification performance. CONCLUSIONS: The score regularization method can be used as a general post-processing step for improving peptide identifications. Source codes and data sets are available at: http://bioinformatics.ust.hk/SRPI.rar. BioMed Central 2011-02-15 /pmc/articles/PMC3044274/ /pubmed/21342549 http://dx.doi.org/10.1186/1471-2105-12-S1-S2 Text en Copyright ©2011 He et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
He, Zengyou
Zhao, Hongyu
Yu, Weichuan
Score regularization for peptide identification
title Score regularization for peptide identification
title_full Score regularization for peptide identification
title_fullStr Score regularization for peptide identification
title_full_unstemmed Score regularization for peptide identification
title_short Score regularization for peptide identification
title_sort score regularization for peptide identification
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3044274/
https://www.ncbi.nlm.nih.gov/pubmed/21342549
http://dx.doi.org/10.1186/1471-2105-12-S1-S2
work_keys_str_mv AT hezengyou scoreregularizationforpeptideidentification
AT zhaohongyu scoreregularizationforpeptideidentification
AT yuweichuan scoreregularizationforpeptideidentification