Cargando…
Exploring food contents in scientific literature with FoodMine
Thanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional component...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7529743/ https://www.ncbi.nlm.nih.gov/pubmed/33004889 http://dx.doi.org/10.1038/s41598-020-73105-0 |
_version_ | 1783589480730460160 |
---|---|
author | Hooton, Forrest Menichetti, Giulia Barabási, Albert-László |
author_facet | Hooton, Forrest Menichetti, Giulia Barabási, Albert-László |
author_sort | Hooton, Forrest |
collection | PubMed |
description | Thanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional components essential for our health. Indeed, thousands of other molecules, many of which have well documented health implications, remain untracked. To explore the body of knowledge available on food composition, we built FoodMine, an algorithm that uses natural language processing to identify papers from PubMed that potentially report on the chemical composition of garlic and cocoa. After extracting from each paper information on the reported quantities of chemicals, we find that the scientific literature carries extensive information on the detailed chemical components of food that is currently not integrated in databases. Finally, we use unsupervised machine learning to create chemical embeddings, finding that the chemicals identified by FoodMine tend to have direct health relevance, reflecting the scientific community’s focus on health-related chemicals in our food. |
format | Online Article Text |
id | pubmed-7529743 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-75297432020-10-02 Exploring food contents in scientific literature with FoodMine Hooton, Forrest Menichetti, Giulia Barabási, Albert-László Sci Rep Article Thanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional components essential for our health. Indeed, thousands of other molecules, many of which have well documented health implications, remain untracked. To explore the body of knowledge available on food composition, we built FoodMine, an algorithm that uses natural language processing to identify papers from PubMed that potentially report on the chemical composition of garlic and cocoa. After extracting from each paper information on the reported quantities of chemicals, we find that the scientific literature carries extensive information on the detailed chemical components of food that is currently not integrated in databases. Finally, we use unsupervised machine learning to create chemical embeddings, finding that the chemicals identified by FoodMine tend to have direct health relevance, reflecting the scientific community’s focus on health-related chemicals in our food. Nature Publishing Group UK 2020-10-01 /pmc/articles/PMC7529743/ /pubmed/33004889 http://dx.doi.org/10.1038/s41598-020-73105-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Hooton, Forrest Menichetti, Giulia Barabási, Albert-László Exploring food contents in scientific literature with FoodMine |
title | Exploring food contents in scientific literature with FoodMine |
title_full | Exploring food contents in scientific literature with FoodMine |
title_fullStr | Exploring food contents in scientific literature with FoodMine |
title_full_unstemmed | Exploring food contents in scientific literature with FoodMine |
title_short | Exploring food contents in scientific literature with FoodMine |
title_sort | exploring food contents in scientific literature with foodmine |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7529743/ https://www.ncbi.nlm.nih.gov/pubmed/33004889 http://dx.doi.org/10.1038/s41598-020-73105-0 |
work_keys_str_mv | AT hootonforrest exploringfoodcontentsinscientificliteraturewithfoodmine AT menichettigiulia exploringfoodcontentsinscientificliteraturewithfoodmine AT barabasialbertlaszlo exploringfoodcontentsinscientificliteraturewithfoodmine |