Cargando…

High-confidence structural annotation of metabolites absent from spectral libraries

Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the C...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoffmann, Martin A., Nothias, Louis-Félix, Ludwig, Marcus, Fleischauer, Markus, Gentry, Emily C., Witting, Michael, Dorrestein, Pieter C., Dührkop, Kai, Böcker, Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8926923/
https://www.ncbi.nlm.nih.gov/pubmed/34650271
http://dx.doi.org/10.1038/s41587-021-01045-9
_version_ 1784670338649423872
author Hoffmann, Martin A.
Nothias, Louis-Félix
Ludwig, Marcus
Fleischauer, Markus
Gentry, Emily C.
Witting, Michael
Dorrestein, Pieter C.
Dührkop, Kai
Böcker, Sebastian
author_facet Hoffmann, Martin A.
Nothias, Louis-Félix
Ludwig, Marcus
Fleischauer, Markus
Gentry, Emily C.
Witting, Michael
Dorrestein, Pieter C.
Dührkop, Kai
Böcker, Sebastian
author_sort Hoffmann, Martin A.
collection PubMed
description Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the COSMIC workflow that combines in silico structure database generation and annotation with a confidence score consisting of kernel density P value estimation and a support vector machine with enforced directionality of features. On diverse datasets, COSMIC annotates a substantial number of hits at low false discovery rates and outperforms spectral library search. To demonstrate that COSMIC can annotate structures never reported before, we annotated 12 natural bile acids. The annotation of nine structures was confirmed by manual evaluation and two structures using synthetic standards. In human samples, we annotated and manually validated 315 molecular structures currently absent from the Human Metabolome Database. Application of COSMIC to data from 17,400 metabolomics experiments led to 1,715 high-confidence structural annotations that were absent from spectral libraries.
format Online
Article
Text
id pubmed-8926923
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-89269232022-04-01 High-confidence structural annotation of metabolites absent from spectral libraries Hoffmann, Martin A. Nothias, Louis-Félix Ludwig, Marcus Fleischauer, Markus Gentry, Emily C. Witting, Michael Dorrestein, Pieter C. Dührkop, Kai Böcker, Sebastian Nat Biotechnol Article Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the COSMIC workflow that combines in silico structure database generation and annotation with a confidence score consisting of kernel density P value estimation and a support vector machine with enforced directionality of features. On diverse datasets, COSMIC annotates a substantial number of hits at low false discovery rates and outperforms spectral library search. To demonstrate that COSMIC can annotate structures never reported before, we annotated 12 natural bile acids. The annotation of nine structures was confirmed by manual evaluation and two structures using synthetic standards. In human samples, we annotated and manually validated 315 molecular structures currently absent from the Human Metabolome Database. Application of COSMIC to data from 17,400 metabolomics experiments led to 1,715 high-confidence structural annotations that were absent from spectral libraries. Nature Publishing Group US 2021-10-14 2022 /pmc/articles/PMC8926923/ /pubmed/34650271 http://dx.doi.org/10.1038/s41587-021-01045-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Hoffmann, Martin A.
Nothias, Louis-Félix
Ludwig, Marcus
Fleischauer, Markus
Gentry, Emily C.
Witting, Michael
Dorrestein, Pieter C.
Dührkop, Kai
Böcker, Sebastian
High-confidence structural annotation of metabolites absent from spectral libraries
title High-confidence structural annotation of metabolites absent from spectral libraries
title_full High-confidence structural annotation of metabolites absent from spectral libraries
title_fullStr High-confidence structural annotation of metabolites absent from spectral libraries
title_full_unstemmed High-confidence structural annotation of metabolites absent from spectral libraries
title_short High-confidence structural annotation of metabolites absent from spectral libraries
title_sort high-confidence structural annotation of metabolites absent from spectral libraries
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8926923/
https://www.ncbi.nlm.nih.gov/pubmed/34650271
http://dx.doi.org/10.1038/s41587-021-01045-9
work_keys_str_mv AT hoffmannmartina highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT nothiaslouisfelix highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT ludwigmarcus highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT fleischauermarkus highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT gentryemilyc highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT wittingmichael highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT dorresteinpieterc highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT duhrkopkai highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries
AT bockersebastian highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries