Cargando…

Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate

Motivation: Identification of post-translationally modified proteins has become one of the central issues of current proteomics. Spectral library search is a new and promising computational approach to mass spectrometry-based protein identification. However, its potential in identification of unanti...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Ding, Fu, Yan, Sun, Rui-Xiang, Wang, Hai-Peng, Yuan, Zuo-Fei, Chi, Hao, He, Si-Min
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881370/
https://www.ncbi.nlm.nih.gov/pubmed/20529934
http://dx.doi.org/10.1093/bioinformatics/btq185
_version_ 1782182108452618240
author Ye, Ding
Fu, Yan
Sun, Rui-Xiang
Wang, Hai-Peng
Yuan, Zuo-Fei
Chi, Hao
He, Si-Min
author_facet Ye, Ding
Fu, Yan
Sun, Rui-Xiang
Wang, Hai-Peng
Yuan, Zuo-Fei
Chi, Hao
He, Si-Min
author_sort Ye, Ding
collection PubMed
description Motivation: Identification of post-translationally modified proteins has become one of the central issues of current proteomics. Spectral library search is a new and promising computational approach to mass spectrometry-based protein identification. However, its potential in identification of unanticipated post-translational modifications has rarely been explored. The existing spectral library search tools are designed to match the query spectrum to the reference library spectra with the same peptide mass. Thus, spectra of peptides with unanticipated modifications cannot be identified. Results: In this article, we present an open spectral library search tool, named pMatch. It extends the existing library search algorithms in at least three aspects to support the identification of unanticipated modifications. First, the spectra in library are optimized with the full peptide sequence information to better tolerate the peptide fragmentation pattern variations caused by some modification(s). Second, a new scoring system is devised, which uses charge-dependent mass shifts for peak matching and combines a probability-based model with the general spectral dot-product for scoring. Third, a target-decoy strategy is used for false discovery rate control. To demonstrate the effectiveness of pMatch, a library search experiment was conducted on a public dataset with over 40 000 spectra in comparison with SpectraST, the most popular library search engine. Additional validations were done on four published datasets including over 150 000 spectra. The results showed that pMatch can effectively identify unanticipated modifications and significantly increase spectral identification rate. Availability: http://pfind.ict.ac.cn/pmatch/ Contact: yfu@ict.ac.cn; rxsun@ict.ac.cn Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2881370
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28813702010-06-08 Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate Ye, Ding Fu, Yan Sun, Rui-Xiang Wang, Hai-Peng Yuan, Zuo-Fei Chi, Hao He, Si-Min Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa Motivation: Identification of post-translationally modified proteins has become one of the central issues of current proteomics. Spectral library search is a new and promising computational approach to mass spectrometry-based protein identification. However, its potential in identification of unanticipated post-translational modifications has rarely been explored. The existing spectral library search tools are designed to match the query spectrum to the reference library spectra with the same peptide mass. Thus, spectra of peptides with unanticipated modifications cannot be identified. Results: In this article, we present an open spectral library search tool, named pMatch. It extends the existing library search algorithms in at least three aspects to support the identification of unanticipated modifications. First, the spectra in library are optimized with the full peptide sequence information to better tolerate the peptide fragmentation pattern variations caused by some modification(s). Second, a new scoring system is devised, which uses charge-dependent mass shifts for peak matching and combines a probability-based model with the general spectral dot-product for scoring. Third, a target-decoy strategy is used for false discovery rate control. To demonstrate the effectiveness of pMatch, a library search experiment was conducted on a public dataset with over 40 000 spectra in comparison with SpectraST, the most popular library search engine. Additional validations were done on four published datasets including over 150 000 spectra. The results showed that pMatch can effectively identify unanticipated modifications and significantly increase spectral identification rate. Availability: http://pfind.ict.ac.cn/pmatch/ Contact: yfu@ict.ac.cn; rxsun@ict.ac.cn Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881370/ /pubmed/20529934 http://dx.doi.org/10.1093/bioinformatics/btq185 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Ye, Ding
Fu, Yan
Sun, Rui-Xiang
Wang, Hai-Peng
Yuan, Zuo-Fei
Chi, Hao
He, Si-Min
Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
title Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
title_full Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
title_fullStr Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
title_full_unstemmed Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
title_short Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
title_sort open ms/ms spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
topic Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881370/
https://www.ncbi.nlm.nih.gov/pubmed/20529934
http://dx.doi.org/10.1093/bioinformatics/btq185
work_keys_str_mv AT yeding openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate
AT fuyan openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate
AT sunruixiang openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate
AT wanghaipeng openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate
AT yuanzuofei openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate
AT chihao openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate
AT hesimin openmsmsspectrallibrarysearchtoidentifyunanticipatedposttranslationalmodificationsandincreasespectralidentificationrate