Cargando…

Fragmentation trees reloaded

BACKGROUND: Untargeted metabolomics commonly uses liquid chromatography mass spectrometry to measure abundances of metabolites; subsequent tandem mass spectrometry is used to derive information about individual compounds. One of the bottlenecks in this experimental setup is the interpretation of fra...

Descripción completa

Detalles Bibliográficos
Autores principales: Böcker, Sebastian, Dührkop, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4736045/
https://www.ncbi.nlm.nih.gov/pubmed/26839597
http://dx.doi.org/10.1186/s13321-016-0116-8
_version_ 1782413197697875968
author Böcker, Sebastian
Dührkop, Kai
author_facet Böcker, Sebastian
Dührkop, Kai
author_sort Böcker, Sebastian
collection PubMed
description BACKGROUND: Untargeted metabolomics commonly uses liquid chromatography mass spectrometry to measure abundances of metabolites; subsequent tandem mass spectrometry is used to derive information about individual compounds. One of the bottlenecks in this experimental setup is the interpretation of fragmentation spectra to accurately and efficiently identify compounds. Fragmentation trees have become a powerful tool for the interpretation of tandem mass spectrometry data of small molecules. These trees are determined from the data using combinatorial optimization, and aim at explaining the experimental data via fragmentation cascades. Fragmentation tree computation does not require spectral or structural databases. To obtain biochemically meaningful trees, one needs an elaborate optimization function (scoring). RESULTS: We present a new scoring for computing fragmentation trees, transforming the combinatorial optimization into a Maximum A Posteriori estimator. We demonstrate the superiority of the new scoring for two tasks: both for the de novo identification of molecular formulas of unknown compounds, and for searching a database for structurally similar compounds, our method SIRIUS 3, performs significantly better than the previous version of our method, as well as other methods for this task. CONCLUSION: SIRIUS 3 can be a part of an untargeted metabolomics workflow, allowing researchers to investigate unknowns using automated computational methods. [Figure: see text]
format Online
Article
Text
id pubmed-4736045
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-47360452016-02-03 Fragmentation trees reloaded Böcker, Sebastian Dührkop, Kai J Cheminform Research Article BACKGROUND: Untargeted metabolomics commonly uses liquid chromatography mass spectrometry to measure abundances of metabolites; subsequent tandem mass spectrometry is used to derive information about individual compounds. One of the bottlenecks in this experimental setup is the interpretation of fragmentation spectra to accurately and efficiently identify compounds. Fragmentation trees have become a powerful tool for the interpretation of tandem mass spectrometry data of small molecules. These trees are determined from the data using combinatorial optimization, and aim at explaining the experimental data via fragmentation cascades. Fragmentation tree computation does not require spectral or structural databases. To obtain biochemically meaningful trees, one needs an elaborate optimization function (scoring). RESULTS: We present a new scoring for computing fragmentation trees, transforming the combinatorial optimization into a Maximum A Posteriori estimator. We demonstrate the superiority of the new scoring for two tasks: both for the de novo identification of molecular formulas of unknown compounds, and for searching a database for structurally similar compounds, our method SIRIUS 3, performs significantly better than the previous version of our method, as well as other methods for this task. CONCLUSION: SIRIUS 3 can be a part of an untargeted metabolomics workflow, allowing researchers to investigate unknowns using automated computational methods. [Figure: see text] Springer International Publishing 2016-02-01 /pmc/articles/PMC4736045/ /pubmed/26839597 http://dx.doi.org/10.1186/s13321-016-0116-8 Text en © Böcker and Dührkop. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Böcker, Sebastian
Dührkop, Kai
Fragmentation trees reloaded
title Fragmentation trees reloaded
title_full Fragmentation trees reloaded
title_fullStr Fragmentation trees reloaded
title_full_unstemmed Fragmentation trees reloaded
title_short Fragmentation trees reloaded
title_sort fragmentation trees reloaded
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4736045/
https://www.ncbi.nlm.nih.gov/pubmed/26839597
http://dx.doi.org/10.1186/s13321-016-0116-8
work_keys_str_mv AT bockersebastian fragmentationtreesreloaded
AT duhrkopkai fragmentationtreesreloaded