Cargando…
pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework
MOTIVATION: De novo peptide sequencing based on tandem mass spectrometry data is the key technology of shotgun proteomics for identifying peptides without any database and assembling unknown proteins. However, owing to the low ion coverage in tandem mass spectra, the order of certain consecutive ami...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612832/ https://www.ncbi.nlm.nih.gov/pubmed/31510687 http://dx.doi.org/10.1093/bioinformatics/btz366 |
_version_ | 1783432947065421824 |
---|---|
author | Yang, Hao Chi, Hao Zeng, Wen-Feng Zhou, Wen-Jing He, Si-Min |
author_facet | Yang, Hao Chi, Hao Zeng, Wen-Feng Zhou, Wen-Jing He, Si-Min |
author_sort | Yang, Hao |
collection | PubMed |
description | MOTIVATION: De novo peptide sequencing based on tandem mass spectrometry data is the key technology of shotgun proteomics for identifying peptides without any database and assembling unknown proteins. However, owing to the low ion coverage in tandem mass spectra, the order of certain consecutive amino acids cannot be determined if all of their supporting fragment ions are missing, which results in the low precision of de novo sequencing. RESULTS: In order to solve this problem, we developed pNovo 3, which used a learning-to-rank framework to distinguish similar peptide candidates for each spectrum. Three metrics for measuring the similarity between each experimental spectrum and its corresponding theoretical spectrum were used as important features, in which the theoretical spectra can be precisely predicted by the pDeep algorithm using deep learning. On seven benchmark datasets from six diverse species, pNovo 3 recalled 29–102% more correct spectra, and the precision was 11–89% higher than three other state-of-the-art de novo sequencing algorithms. Furthermore, compared with the newly developed DeepNovo, which also used the deep learning approach, pNovo 3 still identified 21–50% more spectra on the nine datasets used in the study of DeepNovo. In summary, the deep learning and learning-to-rank techniques implemented in pNovo 3 significantly improve the precision of de novo sequencing, and such machine learning framework is worth extending to other related research fields to distinguish the similar sequences. AVAILABILITY AND IMPLEMENTATION: pNovo 3 can be freely downloaded from http://pfind.ict.ac.cn/software/pNovo/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6612832 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66128322019-07-12 pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework Yang, Hao Chi, Hao Zeng, Wen-Feng Zhou, Wen-Jing He, Si-Min Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: De novo peptide sequencing based on tandem mass spectrometry data is the key technology of shotgun proteomics for identifying peptides without any database and assembling unknown proteins. However, owing to the low ion coverage in tandem mass spectra, the order of certain consecutive amino acids cannot be determined if all of their supporting fragment ions are missing, which results in the low precision of de novo sequencing. RESULTS: In order to solve this problem, we developed pNovo 3, which used a learning-to-rank framework to distinguish similar peptide candidates for each spectrum. Three metrics for measuring the similarity between each experimental spectrum and its corresponding theoretical spectrum were used as important features, in which the theoretical spectra can be precisely predicted by the pDeep algorithm using deep learning. On seven benchmark datasets from six diverse species, pNovo 3 recalled 29–102% more correct spectra, and the precision was 11–89% higher than three other state-of-the-art de novo sequencing algorithms. Furthermore, compared with the newly developed DeepNovo, which also used the deep learning approach, pNovo 3 still identified 21–50% more spectra on the nine datasets used in the study of DeepNovo. In summary, the deep learning and learning-to-rank techniques implemented in pNovo 3 significantly improve the precision of de novo sequencing, and such machine learning framework is worth extending to other related research fields to distinguish the similar sequences. AVAILABILITY AND IMPLEMENTATION: pNovo 3 can be freely downloaded from http://pfind.ict.ac.cn/software/pNovo/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612832/ /pubmed/31510687 http://dx.doi.org/10.1093/bioinformatics/btz366 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2019 Conference Proceedings Yang, Hao Chi, Hao Zeng, Wen-Feng Zhou, Wen-Jing He, Si-Min pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
title | pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
title_full | pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
title_fullStr | pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
title_full_unstemmed | pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
title_short | pNovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
title_sort | pnovo 3: precise de novo peptide sequencing using a learning-to-rank framework |
topic | Ismb/Eccb 2019 Conference Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612832/ https://www.ncbi.nlm.nih.gov/pubmed/31510687 http://dx.doi.org/10.1093/bioinformatics/btz366 |
work_keys_str_mv | AT yanghao pnovo3precisedenovopeptidesequencingusingalearningtorankframework AT chihao pnovo3precisedenovopeptidesequencingusingalearningtorankframework AT zengwenfeng pnovo3precisedenovopeptidesequencingusingalearningtorankframework AT zhouwenjing pnovo3precisedenovopeptidesequencingusingalearningtorankframework AT hesimin pnovo3precisedenovopeptidesequencingusingalearningtorankframework |