Cargando…
FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques
While there exist a plethora of datasets on the Internet related to Food, Energy, and Water (FEW), there is a real lack of reliable methods and tools that can consume these resources. This hinders the development of novel decision-making applications utilizing knowledge graphs. In this paper, we int...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931944/ https://www.ncbi.nlm.nih.gov/pubmed/33693387 http://dx.doi.org/10.3389/fdata.2020.00012 |
_version_ | 1783660388565385216 |
---|---|
author | Gharibi, Mohamed Zachariah, Arun Rao, Praveen |
author_facet | Gharibi, Mohamed Zachariah, Arun Rao, Praveen |
author_sort | Gharibi, Mohamed |
collection | PubMed |
description | While there exist a plethora of datasets on the Internet related to Food, Energy, and Water (FEW), there is a real lack of reliable methods and tools that can consume these resources. This hinders the development of novel decision-making applications utilizing knowledge graphs. In this paper, we introduce a novel software tool, called FoodKG, that enriches FEW knowledge graphs using advanced machine learning techniques. Our overarching goal is to improve decision-making and knowledge discovery as well as to provide improved search results for data scientists in the FEW domains. Given an input knowledge graph (constructed on raw FEW datasets), FoodKG enriches it with semantically related triples, relations, and images based on the original dataset terms and classes. FoodKG employs an existing graph embedding technique trained on a controlled vocabulary called AGROVOC, which is published by the Food and Agriculture Organization of the United Nations. AGROVOC includes terms and classes in the agriculture and food domains. As a result, FoodKG can enhance knowledge graphs with semantic similarity scores and relations between different classes, classify the existing entities, and allow FEW experts and researchers to use scientific terms for describing FEW concepts. The resulting model obtained after training on AGROVOC was evaluated against the state-of-the-art word embedding and knowledge graph embedding models that were trained on the same dataset. We observed that this model outperformed its competitors based on the Spearman Correlation Coefficient score. |
format | Online Article Text |
id | pubmed-7931944 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79319442021-03-09 FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques Gharibi, Mohamed Zachariah, Arun Rao, Praveen Front Big Data Big Data While there exist a plethora of datasets on the Internet related to Food, Energy, and Water (FEW), there is a real lack of reliable methods and tools that can consume these resources. This hinders the development of novel decision-making applications utilizing knowledge graphs. In this paper, we introduce a novel software tool, called FoodKG, that enriches FEW knowledge graphs using advanced machine learning techniques. Our overarching goal is to improve decision-making and knowledge discovery as well as to provide improved search results for data scientists in the FEW domains. Given an input knowledge graph (constructed on raw FEW datasets), FoodKG enriches it with semantically related triples, relations, and images based on the original dataset terms and classes. FoodKG employs an existing graph embedding technique trained on a controlled vocabulary called AGROVOC, which is published by the Food and Agriculture Organization of the United Nations. AGROVOC includes terms and classes in the agriculture and food domains. As a result, FoodKG can enhance knowledge graphs with semantic similarity scores and relations between different classes, classify the existing entities, and allow FEW experts and researchers to use scientific terms for describing FEW concepts. The resulting model obtained after training on AGROVOC was evaluated against the state-of-the-art word embedding and knowledge graph embedding models that were trained on the same dataset. We observed that this model outperformed its competitors based on the Spearman Correlation Coefficient score. Frontiers Media S.A. 2020-04-29 /pmc/articles/PMC7931944/ /pubmed/33693387 http://dx.doi.org/10.3389/fdata.2020.00012 Text en Copyright © 2020 Gharibi, Zachariah and Rao. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Gharibi, Mohamed Zachariah, Arun Rao, Praveen FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques |
title | FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques |
title_full | FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques |
title_fullStr | FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques |
title_full_unstemmed | FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques |
title_short | FoodKG: A Tool to Enrich Knowledge Graphs Using Machine Learning Techniques |
title_sort | foodkg: a tool to enrich knowledge graphs using machine learning techniques |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931944/ https://www.ncbi.nlm.nih.gov/pubmed/33693387 http://dx.doi.org/10.3389/fdata.2020.00012 |
work_keys_str_mv | AT gharibimohamed foodkgatooltoenrichknowledgegraphsusingmachinelearningtechniques AT zachariaharun foodkgatooltoenrichknowledgegraphsusingmachinelearningtechniques AT raopraveen foodkgatooltoenrichknowledgegraphsusingmachinelearningtechniques |