Cargando…
High-confidence structural annotation of metabolites absent from spectral libraries
Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the C...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8926923/ https://www.ncbi.nlm.nih.gov/pubmed/34650271 http://dx.doi.org/10.1038/s41587-021-01045-9 |
_version_ | 1784670338649423872 |
---|---|
author | Hoffmann, Martin A. Nothias, Louis-Félix Ludwig, Marcus Fleischauer, Markus Gentry, Emily C. Witting, Michael Dorrestein, Pieter C. Dührkop, Kai Böcker, Sebastian |
author_facet | Hoffmann, Martin A. Nothias, Louis-Félix Ludwig, Marcus Fleischauer, Markus Gentry, Emily C. Witting, Michael Dorrestein, Pieter C. Dührkop, Kai Böcker, Sebastian |
author_sort | Hoffmann, Martin A. |
collection | PubMed |
description | Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the COSMIC workflow that combines in silico structure database generation and annotation with a confidence score consisting of kernel density P value estimation and a support vector machine with enforced directionality of features. On diverse datasets, COSMIC annotates a substantial number of hits at low false discovery rates and outperforms spectral library search. To demonstrate that COSMIC can annotate structures never reported before, we annotated 12 natural bile acids. The annotation of nine structures was confirmed by manual evaluation and two structures using synthetic standards. In human samples, we annotated and manually validated 315 molecular structures currently absent from the Human Metabolome Database. Application of COSMIC to data from 17,400 metabolomics experiments led to 1,715 high-confidence structural annotations that were absent from spectral libraries. |
format | Online Article Text |
id | pubmed-8926923 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-89269232022-04-01 High-confidence structural annotation of metabolites absent from spectral libraries Hoffmann, Martin A. Nothias, Louis-Félix Ludwig, Marcus Fleischauer, Markus Gentry, Emily C. Witting, Michael Dorrestein, Pieter C. Dührkop, Kai Böcker, Sebastian Nat Biotechnol Article Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the COSMIC workflow that combines in silico structure database generation and annotation with a confidence score consisting of kernel density P value estimation and a support vector machine with enforced directionality of features. On diverse datasets, COSMIC annotates a substantial number of hits at low false discovery rates and outperforms spectral library search. To demonstrate that COSMIC can annotate structures never reported before, we annotated 12 natural bile acids. The annotation of nine structures was confirmed by manual evaluation and two structures using synthetic standards. In human samples, we annotated and manually validated 315 molecular structures currently absent from the Human Metabolome Database. Application of COSMIC to data from 17,400 metabolomics experiments led to 1,715 high-confidence structural annotations that were absent from spectral libraries. Nature Publishing Group US 2021-10-14 2022 /pmc/articles/PMC8926923/ /pubmed/34650271 http://dx.doi.org/10.1038/s41587-021-01045-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Hoffmann, Martin A. Nothias, Louis-Félix Ludwig, Marcus Fleischauer, Markus Gentry, Emily C. Witting, Michael Dorrestein, Pieter C. Dührkop, Kai Böcker, Sebastian High-confidence structural annotation of metabolites absent from spectral libraries |
title | High-confidence structural annotation of metabolites absent from spectral libraries |
title_full | High-confidence structural annotation of metabolites absent from spectral libraries |
title_fullStr | High-confidence structural annotation of metabolites absent from spectral libraries |
title_full_unstemmed | High-confidence structural annotation of metabolites absent from spectral libraries |
title_short | High-confidence structural annotation of metabolites absent from spectral libraries |
title_sort | high-confidence structural annotation of metabolites absent from spectral libraries |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8926923/ https://www.ncbi.nlm.nih.gov/pubmed/34650271 http://dx.doi.org/10.1038/s41587-021-01045-9 |
work_keys_str_mv | AT hoffmannmartina highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT nothiaslouisfelix highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT ludwigmarcus highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT fleischauermarkus highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT gentryemilyc highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT wittingmichael highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT dorresteinpieterc highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT duhrkopkai highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries AT bockersebastian highconfidencestructuralannotationofmetabolitesabsentfromspectrallibraries |