Cargando…
Score regularization for peptide identification
BACKGROUND: Peptide identification from tandem mass spectrometry (MS/MS) data is one of the most important problems in computational proteomics. This technique relies heavily on the accurate assessment of the quality of peptide-spectrum matches (PSMs). However, current MS technology and PSM scoring...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3044274/ https://www.ncbi.nlm.nih.gov/pubmed/21342549 http://dx.doi.org/10.1186/1471-2105-12-S1-S2 |
_version_ | 1782198708348125184 |
---|---|
author | He, Zengyou Zhao, Hongyu Yu, Weichuan |
author_facet | He, Zengyou Zhao, Hongyu Yu, Weichuan |
author_sort | He, Zengyou |
collection | PubMed |
description | BACKGROUND: Peptide identification from tandem mass spectrometry (MS/MS) data is one of the most important problems in computational proteomics. This technique relies heavily on the accurate assessment of the quality of peptide-spectrum matches (PSMs). However, current MS technology and PSM scoring algorithm are far from perfect, leading to the generation of incorrect peptide-spectrum pairs. Thus, it is critical to develop new post-processing techniques that can distinguish true identifications from false identifications effectively. RESULTS: In this paper, we present a consistency-based PSM re-ranking method to improve the initial identification results. This method uses one additional assumption that two peptides belonging to the same protein should be correlated to each other. We formulate an optimization problem that embraces two objectives through regularization: the smoothing consistency among scores of correlated peptides and the fitting consistency between new scores and initial scores. This optimization problem can be solved analytically. The experimental study on several real MS/MS data sets shows that this re-ranking method improves the identification performance. CONCLUSIONS: The score regularization method can be used as a general post-processing step for improving peptide identifications. Source codes and data sets are available at: http://bioinformatics.ust.hk/SRPI.rar. |
format | Text |
id | pubmed-3044274 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30442742011-02-25 Score regularization for peptide identification He, Zengyou Zhao, Hongyu Yu, Weichuan BMC Bioinformatics Research BACKGROUND: Peptide identification from tandem mass spectrometry (MS/MS) data is one of the most important problems in computational proteomics. This technique relies heavily on the accurate assessment of the quality of peptide-spectrum matches (PSMs). However, current MS technology and PSM scoring algorithm are far from perfect, leading to the generation of incorrect peptide-spectrum pairs. Thus, it is critical to develop new post-processing techniques that can distinguish true identifications from false identifications effectively. RESULTS: In this paper, we present a consistency-based PSM re-ranking method to improve the initial identification results. This method uses one additional assumption that two peptides belonging to the same protein should be correlated to each other. We formulate an optimization problem that embraces two objectives through regularization: the smoothing consistency among scores of correlated peptides and the fitting consistency between new scores and initial scores. This optimization problem can be solved analytically. The experimental study on several real MS/MS data sets shows that this re-ranking method improves the identification performance. CONCLUSIONS: The score regularization method can be used as a general post-processing step for improving peptide identifications. Source codes and data sets are available at: http://bioinformatics.ust.hk/SRPI.rar. BioMed Central 2011-02-15 /pmc/articles/PMC3044274/ /pubmed/21342549 http://dx.doi.org/10.1186/1471-2105-12-S1-S2 Text en Copyright ©2011 He et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research He, Zengyou Zhao, Hongyu Yu, Weichuan Score regularization for peptide identification |
title | Score regularization for peptide identification |
title_full | Score regularization for peptide identification |
title_fullStr | Score regularization for peptide identification |
title_full_unstemmed | Score regularization for peptide identification |
title_short | Score regularization for peptide identification |
title_sort | score regularization for peptide identification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3044274/ https://www.ncbi.nlm.nih.gov/pubmed/21342549 http://dx.doi.org/10.1186/1471-2105-12-S1-S2 |
work_keys_str_mv | AT hezengyou scoreregularizationforpeptideidentification AT zhaohongyu scoreregularizationforpeptideidentification AT yuweichuan scoreregularizationforpeptideidentification |