Cargando…

Mistle: bringing spectral library predictions to metaproteomics with an efficient search index

MOTIVATION: Deep learning has moved to the forefront of tandem mass spectrometry-driven proteomics and authentic prediction for peptide fragmentation is more feasible than ever. Still, at this point spectral prediction is mainly used to validate database search results or for confined search spaces....

Descripción completa

Detalles Bibliográficos
Autores principales: Nowatzky, Yannek, Benner, Philipp, Reinert, Knut, Muth, Thilo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313348/
https://www.ncbi.nlm.nih.gov/pubmed/37294786
http://dx.doi.org/10.1093/bioinformatics/btad376
_version_ 1785067107155705856
author Nowatzky, Yannek
Benner, Philipp
Reinert, Knut
Muth, Thilo
author_facet Nowatzky, Yannek
Benner, Philipp
Reinert, Knut
Muth, Thilo
author_sort Nowatzky, Yannek
collection PubMed
description MOTIVATION: Deep learning has moved to the forefront of tandem mass spectrometry-driven proteomics and authentic prediction for peptide fragmentation is more feasible than ever. Still, at this point spectral prediction is mainly used to validate database search results or for confined search spaces. Fully predicted spectral libraries have not yet been efficiently adapted to large search space problems that often occur in metaproteomics or proteogenomics. RESULTS: In this study, we showcase a workflow that uses Prosit for spectral library predictions on two common metaproteomes and implement an indexing and search algorithm, Mistle, to efficiently identify experimental mass spectra within the library. Hence, the workflow emulates a classic protein sequence database search with protein digestion but builds a searchable index from spectral predictions as an in-between step. We compare Mistle to popular search engines, both on a spectral and database search level, and provide evidence that this approach is more accurate than a database search using MSFragger. Mistle outperforms other spectral library search engines in terms of run time and proves to be extremely memory efficient with a 4- to 22-fold decrease in RAM usage. This makes Mistle universally applicable to large search spaces, e.g. covering comprehensive sequence databases of diverse microbiomes. AVAILABILITY AND IMPLEMENTATION: Mistle is freely available on GitHub at https://github.com/BAMeScience/Mistle.
format Online
Article
Text
id pubmed-10313348
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103133482023-07-01 Mistle: bringing spectral library predictions to metaproteomics with an efficient search index Nowatzky, Yannek Benner, Philipp Reinert, Knut Muth, Thilo Bioinformatics Original Paper MOTIVATION: Deep learning has moved to the forefront of tandem mass spectrometry-driven proteomics and authentic prediction for peptide fragmentation is more feasible than ever. Still, at this point spectral prediction is mainly used to validate database search results or for confined search spaces. Fully predicted spectral libraries have not yet been efficiently adapted to large search space problems that often occur in metaproteomics or proteogenomics. RESULTS: In this study, we showcase a workflow that uses Prosit for spectral library predictions on two common metaproteomes and implement an indexing and search algorithm, Mistle, to efficiently identify experimental mass spectra within the library. Hence, the workflow emulates a classic protein sequence database search with protein digestion but builds a searchable index from spectral predictions as an in-between step. We compare Mistle to popular search engines, both on a spectral and database search level, and provide evidence that this approach is more accurate than a database search using MSFragger. Mistle outperforms other spectral library search engines in terms of run time and proves to be extremely memory efficient with a 4- to 22-fold decrease in RAM usage. This makes Mistle universally applicable to large search spaces, e.g. covering comprehensive sequence databases of diverse microbiomes. AVAILABILITY AND IMPLEMENTATION: Mistle is freely available on GitHub at https://github.com/BAMeScience/Mistle. Oxford University Press 2023-06-09 /pmc/articles/PMC10313348/ /pubmed/37294786 http://dx.doi.org/10.1093/bioinformatics/btad376 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Nowatzky, Yannek
Benner, Philipp
Reinert, Knut
Muth, Thilo
Mistle: bringing spectral library predictions to metaproteomics with an efficient search index
title Mistle: bringing spectral library predictions to metaproteomics with an efficient search index
title_full Mistle: bringing spectral library predictions to metaproteomics with an efficient search index
title_fullStr Mistle: bringing spectral library predictions to metaproteomics with an efficient search index
title_full_unstemmed Mistle: bringing spectral library predictions to metaproteomics with an efficient search index
title_short Mistle: bringing spectral library predictions to metaproteomics with an efficient search index
title_sort mistle: bringing spectral library predictions to metaproteomics with an efficient search index
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313348/
https://www.ncbi.nlm.nih.gov/pubmed/37294786
http://dx.doi.org/10.1093/bioinformatics/btad376
work_keys_str_mv AT nowatzkyyannek mistlebringingspectrallibrarypredictionstometaproteomicswithanefficientsearchindex
AT bennerphilipp mistlebringingspectrallibrarypredictionstometaproteomicswithanefficientsearchindex
AT reinertknut mistlebringingspectrallibrarypredictionstometaproteomicswithanefficientsearchindex
AT muththilo mistlebringingspectrallibrarypredictionstometaproteomicswithanefficientsearchindex