Cargando…

Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas

The use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Frainay, Clément, Schymanski, Emma L., Neumann, Steffen, Merlet, Benjamin, Salek, Reza M., Jourdan, Fabien, Yanes, Oscar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6161000/
https://www.ncbi.nlm.nih.gov/pubmed/30223552
http://dx.doi.org/10.3390/metabo8030051
_version_ 1783358893501448192
author Frainay, Clément
Schymanski, Emma L.
Neumann, Steffen
Merlet, Benjamin
Salek, Reza M.
Jourdan, Fabien
Yanes, Oscar
author_facet Frainay, Clément
Schymanski, Emma L.
Neumann, Steffen
Merlet, Benjamin
Salek, Reza M.
Jourdan, Fabien
Yanes, Oscar
author_sort Frainay, Clément
collection PubMed
description The use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference standards. While reference databases for metabolomics now provide information for hundreds of thousands of compounds, barely 5% of these known small molecules have experimental data from pure standards. Remarkably, it is still unknown how well existing mass spectral libraries cover the biochemical landscape of prokaryotic and eukaryotic organisms. To address this issue, we have investigated the coverage of 38 genome-scale metabolic networks by public and commercial mass spectral databases, and found that on average only 40% of nodes in metabolic networks could be mapped by mass spectral information from standards. Next, we deciphered computationally which parts of the human metabolic network are poorly covered by mass spectral libraries, revealing gaps in the eicosanoids, vitamins and bile acid metabolism. Finally, our network topology analysis based on the betweenness centrality of metabolites revealed the top 20 most important metabolites that, if added to MS databases, may facilitate human metabolome characterization in the future.
format Online
Article
Text
id pubmed-6161000
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-61610002018-09-28 Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas Frainay, Clément Schymanski, Emma L. Neumann, Steffen Merlet, Benjamin Salek, Reza M. Jourdan, Fabien Yanes, Oscar Metabolites Article The use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference standards. While reference databases for metabolomics now provide information for hundreds of thousands of compounds, barely 5% of these known small molecules have experimental data from pure standards. Remarkably, it is still unknown how well existing mass spectral libraries cover the biochemical landscape of prokaryotic and eukaryotic organisms. To address this issue, we have investigated the coverage of 38 genome-scale metabolic networks by public and commercial mass spectral databases, and found that on average only 40% of nodes in metabolic networks could be mapped by mass spectral information from standards. Next, we deciphered computationally which parts of the human metabolic network are poorly covered by mass spectral libraries, revealing gaps in the eicosanoids, vitamins and bile acid metabolism. Finally, our network topology analysis based on the betweenness centrality of metabolites revealed the top 20 most important metabolites that, if added to MS databases, may facilitate human metabolome characterization in the future. MDPI 2018-09-15 /pmc/articles/PMC6161000/ /pubmed/30223552 http://dx.doi.org/10.3390/metabo8030051 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Frainay, Clément
Schymanski, Emma L.
Neumann, Steffen
Merlet, Benjamin
Salek, Reza M.
Jourdan, Fabien
Yanes, Oscar
Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_full Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_fullStr Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_full_unstemmed Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_short Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_sort mind the gap: mapping mass spectral databases in genome-scale metabolic networks reveals poorly covered areas
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6161000/
https://www.ncbi.nlm.nih.gov/pubmed/30223552
http://dx.doi.org/10.3390/metabo8030051
work_keys_str_mv AT frainayclement mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT schymanskiemmal mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT neumannsteffen mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT merletbenjamin mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT salekrezam mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT jourdanfabien mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT yanesoscar mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas