Cargando…

Exploring food contents in scientific literature with FoodMine

Thanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional component...

Descripción completa

Detalles Bibliográficos
Autores principales: Hooton, Forrest, Menichetti, Giulia, Barabási, Albert-László
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7529743/
https://www.ncbi.nlm.nih.gov/pubmed/33004889
http://dx.doi.org/10.1038/s41598-020-73105-0
_version_ 1783589480730460160
author Hooton, Forrest
Menichetti, Giulia
Barabási, Albert-László
author_facet Hooton, Forrest
Menichetti, Giulia
Barabási, Albert-László
author_sort Hooton, Forrest
collection PubMed
description Thanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional components essential for our health. Indeed, thousands of other molecules, many of which have well documented health implications, remain untracked. To explore the body of knowledge available on food composition, we built FoodMine, an algorithm that uses natural language processing to identify papers from PubMed that potentially report on the chemical composition of garlic and cocoa. After extracting from each paper information on the reported quantities of chemicals, we find that the scientific literature carries extensive information on the detailed chemical components of food that is currently not integrated in databases. Finally, we use unsupervised machine learning to create chemical embeddings, finding that the chemicals identified by FoodMine tend to have direct health relevance, reflecting the scientific community’s focus on health-related chemicals in our food.
format Online
Article
Text
id pubmed-7529743
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-75297432020-10-02 Exploring food contents in scientific literature with FoodMine Hooton, Forrest Menichetti, Giulia Barabási, Albert-László Sci Rep Article Thanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional components essential for our health. Indeed, thousands of other molecules, many of which have well documented health implications, remain untracked. To explore the body of knowledge available on food composition, we built FoodMine, an algorithm that uses natural language processing to identify papers from PubMed that potentially report on the chemical composition of garlic and cocoa. After extracting from each paper information on the reported quantities of chemicals, we find that the scientific literature carries extensive information on the detailed chemical components of food that is currently not integrated in databases. Finally, we use unsupervised machine learning to create chemical embeddings, finding that the chemicals identified by FoodMine tend to have direct health relevance, reflecting the scientific community’s focus on health-related chemicals in our food. Nature Publishing Group UK 2020-10-01 /pmc/articles/PMC7529743/ /pubmed/33004889 http://dx.doi.org/10.1038/s41598-020-73105-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Hooton, Forrest
Menichetti, Giulia
Barabási, Albert-László
Exploring food contents in scientific literature with FoodMine
title Exploring food contents in scientific literature with FoodMine
title_full Exploring food contents in scientific literature with FoodMine
title_fullStr Exploring food contents in scientific literature with FoodMine
title_full_unstemmed Exploring food contents in scientific literature with FoodMine
title_short Exploring food contents in scientific literature with FoodMine
title_sort exploring food contents in scientific literature with foodmine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7529743/
https://www.ncbi.nlm.nih.gov/pubmed/33004889
http://dx.doi.org/10.1038/s41598-020-73105-0
work_keys_str_mv AT hootonforrest exploringfoodcontentsinscientificliteraturewithfoodmine
AT menichettigiulia exploringfoodcontentsinscientificliteraturewithfoodmine
AT barabasialbertlaszlo exploringfoodcontentsinscientificliteraturewithfoodmine