Cargando…

Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope

[Image: see text] Nowadays, monoisotopic mass is used as an important feature in top-down proteomics. Knowing the exact monoisotopic mass is helpful for precise and quick protein identification in large protein databases. However, only in spectra of small molecules the monoisotopic peak is visible....

Descripción completa

Detalles Bibliográficos
Autores principales: Radziński, Piotr, Valkenborg, Dirk, Startek, Michał Piotr, Gambin, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9634886/
https://www.ncbi.nlm.nih.gov/pubmed/36223196
http://dx.doi.org/10.1021/jasms.2c00176
_version_ 1784824597447704576
author Radziński, Piotr
Valkenborg, Dirk
Startek, Michał Piotr
Gambin, Anna
author_facet Radziński, Piotr
Valkenborg, Dirk
Startek, Michał Piotr
Gambin, Anna
author_sort Radziński, Piotr
collection PubMed
description [Image: see text] Nowadays, monoisotopic mass is used as an important feature in top-down proteomics. Knowing the exact monoisotopic mass is helpful for precise and quick protein identification in large protein databases. However, only in spectra of small molecules the monoisotopic peak is visible. For bigger molecules like proteins, it is hidden in noise or undetected at all, and therefore its position has to be predicted. By improving the prediction of the peak, we contribute to a more accurate identification of molecules, which is crucial in fields such as chemistry and medicine. In this work, we present the envemind algorithm, which is a two-step procedure to predict monoisotopic masses of proteins. The prediction is based on an isotopic envelope. Therefore, envemind is dedicated to spectra where we are able to resolve the one dalton separated isotopic variants. Furthermore, only single-molecule spectra are allowed, that is, spectra that do not require prior deconvolution. The algorithm deals with the problem of off-by-one dalton errors, which are common in monoisotopic mass prediction. A novel aspect of this work is a mathematical exploration of the space of molecules, where we equate chemical formulas and their theoretical spectrum. Since the space of molecules consists of all possible chemical formulas, this approach is not limited to known substances only. This makes optimization processes faster and enables to approximate theoretical spectrum for a given experimental one. The algorithm is available as a Python package envemind on our GitHub page https://github.com/PiotrRadzinski/envemind.
format Online
Article
Text
id pubmed-9634886
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-96348862022-11-05 Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope Radziński, Piotr Valkenborg, Dirk Startek, Michał Piotr Gambin, Anna J Am Soc Mass Spectrom [Image: see text] Nowadays, monoisotopic mass is used as an important feature in top-down proteomics. Knowing the exact monoisotopic mass is helpful for precise and quick protein identification in large protein databases. However, only in spectra of small molecules the monoisotopic peak is visible. For bigger molecules like proteins, it is hidden in noise or undetected at all, and therefore its position has to be predicted. By improving the prediction of the peak, we contribute to a more accurate identification of molecules, which is crucial in fields such as chemistry and medicine. In this work, we present the envemind algorithm, which is a two-step procedure to predict monoisotopic masses of proteins. The prediction is based on an isotopic envelope. Therefore, envemind is dedicated to spectra where we are able to resolve the one dalton separated isotopic variants. Furthermore, only single-molecule spectra are allowed, that is, spectra that do not require prior deconvolution. The algorithm deals with the problem of off-by-one dalton errors, which are common in monoisotopic mass prediction. A novel aspect of this work is a mathematical exploration of the space of molecules, where we equate chemical formulas and their theoretical spectrum. Since the space of molecules consists of all possible chemical formulas, this approach is not limited to known substances only. This makes optimization processes faster and enables to approximate theoretical spectrum for a given experimental one. The algorithm is available as a Python package envemind on our GitHub page https://github.com/PiotrRadzinski/envemind. American Chemical Society 2022-10-12 2022-11-02 /pmc/articles/PMC9634886/ /pubmed/36223196 http://dx.doi.org/10.1021/jasms.2c00176 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Radziński, Piotr
Valkenborg, Dirk
Startek, Michał Piotr
Gambin, Anna
Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope
title Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope
title_full Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope
title_fullStr Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope
title_full_unstemmed Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope
title_short Envemind: Accurate Monoisotopic Mass Determination Based On Isotopic Envelope
title_sort envemind: accurate monoisotopic mass determination based on isotopic envelope
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9634886/
https://www.ncbi.nlm.nih.gov/pubmed/36223196
http://dx.doi.org/10.1021/jasms.2c00176
work_keys_str_mv AT radzinskipiotr envemindaccuratemonoisotopicmassdeterminationbasedonisotopicenvelope
AT valkenborgdirk envemindaccuratemonoisotopicmassdeterminationbasedonisotopicenvelope
AT startekmichałpiotr envemindaccuratemonoisotopicmassdeterminationbasedonisotopicenvelope
AT gambinanna envemindaccuratemonoisotopicmassdeterminationbasedonisotopicenvelope