Cargando…

FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques

While there exist a plethora of datasets on the Internet related to Food, Energy, and Water (FEW), there is a real lack of reliable methods and tools that can consume these resources. This hinders the development of novel decision-making applications utilizing knowledge graphs. In this paper, we int...

Descripción completa

Detalles Bibliográficos
Autores principales: Gharibi, Mohamed, Zachariah, Arun, Rao, Praveen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931944/
https://www.ncbi.nlm.nih.gov/pubmed/33693387
http://dx.doi.org/10.3389/fdata.2020.00012
_version_ 1783660388565385216
author Gharibi, Mohamed
Zachariah, Arun
Rao, Praveen
author_facet Gharibi, Mohamed
Zachariah, Arun
Rao, Praveen
author_sort Gharibi, Mohamed
collection PubMed
description While there exist a plethora of datasets on the Internet related to Food, Energy, and Water (FEW), there is a real lack of reliable methods and tools that can consume these resources. This hinders the development of novel decision-making applications utilizing knowledge graphs. In this paper, we introduce a novel software tool, called FoodKG, that enriches FEW knowledge graphs using advanced machine learning techniques. Our overarching goal is to improve decision-making and knowledge discovery as well as to provide improved search results for data scientists in the FEW domains. Given an input knowledge graph (constructed on raw FEW datasets), FoodKG enriches it with semantically related triples, relations, and images based on the original dataset terms and classes. FoodKG employs an existing graph embedding technique trained on a controlled vocabulary called AGROVOC, which is published by the Food and Agriculture Organization of the United Nations. AGROVOC includes terms and classes in the agriculture and food domains. As a result, FoodKG can enhance knowledge graphs with semantic similarity scores and relations between different classes, classify the existing entities, and allow FEW experts and researchers to use scientific terms for describing FEW concepts. The resulting model obtained after training on AGROVOC was evaluated against the state-of-the-art word embedding and knowledge graph embedding models that were trained on the same dataset. We observed that this model outperformed its competitors based on the Spearman Correlation Coefficient score.
format Online
Article
Text
id pubmed-7931944
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79319442021-03-09 FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques Gharibi, Mohamed Zachariah, Arun Rao, Praveen Front Big Data Big Data While there exist a plethora of datasets on the Internet related to Food, Energy, and Water (FEW), there is a real lack of reliable methods and tools that can consume these resources. This hinders the development of novel decision-making applications utilizing knowledge graphs. In this paper, we introduce a novel software tool, called FoodKG, that enriches FEW knowledge graphs using advanced machine learning techniques. Our overarching goal is to improve decision-making and knowledge discovery as well as to provide improved search results for data scientists in the FEW domains. Given an input knowledge graph (constructed on raw FEW datasets), FoodKG enriches it with semantically related triples, relations, and images based on the original dataset terms and classes. FoodKG employs an existing graph embedding technique trained on a controlled vocabulary called AGROVOC, which is published by the Food and Agriculture Organization of the United Nations. AGROVOC includes terms and classes in the agriculture and food domains. As a result, FoodKG can enhance knowledge graphs with semantic similarity scores and relations between different classes, classify the existing entities, and allow FEW experts and researchers to use scientific terms for describing FEW concepts. The resulting model obtained after training on AGROVOC was evaluated against the state-of-the-art word embedding and knowledge graph embedding models that were trained on the same dataset. We observed that this model outperformed its competitors based on the Spearman Correlation Coefficient score. Frontiers Media S.A. 2020-04-29 /pmc/articles/PMC7931944/ /pubmed/33693387 http://dx.doi.org/10.3389/fdata.2020.00012 Text en Copyright © 2020 Gharibi, Zachariah and Rao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Gharibi, Mohamed
Zachariah, Arun
Rao, Praveen
FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
title FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
title_full FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
title_fullStr FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
title_full_unstemmed FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
title_short FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
title_sort foodkg: a tool to enrich knowledge graphs using machine learning techniques
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931944/
https://www.ncbi.nlm.nih.gov/pubmed/33693387
http://dx.doi.org/10.3389/fdata.2020.00012
work_keys_str_mv AT gharibimohamed foodkgatooltoenrichknowledgegraphsusingmachinelearningtechniques
AT zachariaharun foodkgatooltoenrichknowledgegraphsusingmachinelearningtechniques
AT raopraveen foodkgatooltoenrichknowledgegraphsusingmachinelearningtechniques