Cargando…

Analysis of a large food chemical database: chemical space, diversity, and complexity

Background: Food chemicals are a cornerstone in the food industry. However, its chemical diversity has been explored on a limited basis, for instance, previous analysis of food-related databases were done up to 2,200 molecules. The goal of this work was to quantify the chemical diversity of chemical...

Descripción completa

Detalles Bibliográficos
Autores principales: Naveja, J. Jesús, Rico-Hidalgo, Mariel P., Medina-Franco, José L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6081979/
https://www.ncbi.nlm.nih.gov/pubmed/30135721
http://dx.doi.org/10.12688/f1000research.15440.2
_version_ 1783345741805125632
author Naveja, J. Jesús
Rico-Hidalgo, Mariel P.
Medina-Franco, José L.
author_facet Naveja, J. Jesús
Rico-Hidalgo, Mariel P.
Medina-Franco, José L.
author_sort Naveja, J. Jesús
collection PubMed
description Background: Food chemicals are a cornerstone in the food industry. However, its chemical diversity has been explored on a limited basis, for instance, previous analysis of food-related databases were done up to 2,200 molecules. The goal of this work was to quantify the chemical diversity of chemical compounds stored in FooDB, a database with nearly 24,000 food chemicals. Methods: The visual representation of the chemical space of FooDB was done with ChemMaps, a novel approach based on the concept of chemical satellites. The large food chemical database was profiled based on physicochemical properties, molecular complexity and scaffold content. The global diversity of FooDB was characterized using Consensus Diversity Plots. Results: It was found that compounds in FooDB are very diverse in terms of properties and structure, with a large structural complexity. It was also found that one third of the food chemicals are acyclic molecules and ring-containing molecules are mostly monocyclic, with several scaffolds common to natural products in other databases. Conclusions: To the best of our knowledge, this is the first analysis of the chemical diversity and complexity of FooDB. This study represents a step further to the emerging field of “Food Informatics”. Future study should compare directly the chemical structures of the molecules in FooDB with other compound databases, for instance, drug-like databases and natural products collections. An additional future direction of this work is to use the list of 3,228 polyphenolic compounds identified in this work to enhance the on-going polyphenol-protein interactome studies.
format Online
Article
Text
id pubmed-6081979
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-60819792018-08-21 Analysis of a large food chemical database: chemical space, diversity, and complexity Naveja, J. Jesús Rico-Hidalgo, Mariel P. Medina-Franco, José L. F1000Res Research Article Background: Food chemicals are a cornerstone in the food industry. However, its chemical diversity has been explored on a limited basis, for instance, previous analysis of food-related databases were done up to 2,200 molecules. The goal of this work was to quantify the chemical diversity of chemical compounds stored in FooDB, a database with nearly 24,000 food chemicals. Methods: The visual representation of the chemical space of FooDB was done with ChemMaps, a novel approach based on the concept of chemical satellites. The large food chemical database was profiled based on physicochemical properties, molecular complexity and scaffold content. The global diversity of FooDB was characterized using Consensus Diversity Plots. Results: It was found that compounds in FooDB are very diverse in terms of properties and structure, with a large structural complexity. It was also found that one third of the food chemicals are acyclic molecules and ring-containing molecules are mostly monocyclic, with several scaffolds common to natural products in other databases. Conclusions: To the best of our knowledge, this is the first analysis of the chemical diversity and complexity of FooDB. This study represents a step further to the emerging field of “Food Informatics”. Future study should compare directly the chemical structures of the molecules in FooDB with other compound databases, for instance, drug-like databases and natural products collections. An additional future direction of this work is to use the list of 3,228 polyphenolic compounds identified in this work to enhance the on-going polyphenol-protein interactome studies. F1000 Research Limited 2018-08-10 /pmc/articles/PMC6081979/ /pubmed/30135721 http://dx.doi.org/10.12688/f1000research.15440.2 Text en Copyright: © 2018 Naveja JJ et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Naveja, J. Jesús
Rico-Hidalgo, Mariel P.
Medina-Franco, José L.
Analysis of a large food chemical database: chemical space, diversity, and complexity
title Analysis of a large food chemical database: chemical space, diversity, and complexity
title_full Analysis of a large food chemical database: chemical space, diversity, and complexity
title_fullStr Analysis of a large food chemical database: chemical space, diversity, and complexity
title_full_unstemmed Analysis of a large food chemical database: chemical space, diversity, and complexity
title_short Analysis of a large food chemical database: chemical space, diversity, and complexity
title_sort analysis of a large food chemical database: chemical space, diversity, and complexity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6081979/
https://www.ncbi.nlm.nih.gov/pubmed/30135721
http://dx.doi.org/10.12688/f1000research.15440.2
work_keys_str_mv AT navejajjesus analysisofalargefoodchemicaldatabasechemicalspacediversityandcomplexity
AT ricohidalgomarielp analysisofalargefoodchemicaldatabasechemicalspacediversityandcomplexity
AT medinafrancojosel analysisofalargefoodchemicaldatabasechemicalspacediversityandcomplexity