Cargando…

Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition

BACKGROUND: Nutrition is one of the main factors affecting the development and quality of life of a person. From a public health perspective, food security is an essential social determinant for promoting healthy nutrition. Food security embraces four dimensions: physical availability of food, econo...

Descripción completa

Detalles Bibliográficos
Autores principales: Restrepo, David S., Pérez, Luis E., López, Diego M., Vargas-Cañas, Rubiel, Osorio-Valencia, Juan Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8828574/
https://www.ncbi.nlm.nih.gov/pubmed/35155518
http://dx.doi.org/10.3389/fnut.2021.796082
_version_ 1784647877941788672
author Restrepo, David S.
Pérez, Luis E.
López, Diego M.
Vargas-Cañas, Rubiel
Osorio-Valencia, Juan Sebastian
author_facet Restrepo, David S.
Pérez, Luis E.
López, Diego M.
Vargas-Cañas, Rubiel
Osorio-Valencia, Juan Sebastian
author_sort Restrepo, David S.
collection PubMed
description BACKGROUND: Nutrition is one of the main factors affecting the development and quality of life of a person. From a public health perspective, food security is an essential social determinant for promoting healthy nutrition. Food security embraces four dimensions: physical availability of food, economic and physical access to food, food utilization, and the sustainability of the dimensions above. Integrally addressing the four dimensions is vital. Surprisingly most of the works focused on a single dimension of food security: the physical availability of food. OBJECTIVE: The paper proposes a multi-dimensional dataset of open data and satellite images to characterize food security in the department of Cauca, Colombia. METHODS: The food security dataset integrates multiple open data sources; therefore, the Cross-Industry Standard Process for Data Mining methodology was used to guide the construction of the dataset. It includes sources such as population and agricultural census, nutrition surveys, and satellite images. RESULTS: An open multidimensional dataset for the Department of Cauca with 926 attributes and 9 rows (each row representing a Municipality) from multiple sources in Colombia, is configured. Then, machine learning models were used to characterize food security and nutrition in the Cauca Department. As a result, The Food security index calculated for Cauca using a linear regression model (Mean Absolute Error of 0.391) is 57.444 in a range between 0 and 100, with 100 the best score. Also, an approach for extracting four features (Agriculture, Habitation, Road, Water) of satellite images were tested with the ResNet50 model trained from scratch, having the best performance with a macro-accuracy, macro-precision, macro-recall, and macro-F1-score of 91.7, 86.2, 66.91, and 74.92%, respectively. CONCLUSION: It shows how the CRISP-DM methodology can be used to create an open public health data repository. Furthermore, this methodology could be generalized to other types of problems requiring the creation of a dataset. In addition, the use of satellite images presents an alternative for places where data collection is challenging. The model and methodology proposed based on open data become a low-cost and effective solution that could be used by decision-makers, especially in developing countries, to support food security planning.
format Online
Article
Text
id pubmed-8828574
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88285742022-02-11 Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition Restrepo, David S. Pérez, Luis E. López, Diego M. Vargas-Cañas, Rubiel Osorio-Valencia, Juan Sebastian Front Nutr Nutrition BACKGROUND: Nutrition is one of the main factors affecting the development and quality of life of a person. From a public health perspective, food security is an essential social determinant for promoting healthy nutrition. Food security embraces four dimensions: physical availability of food, economic and physical access to food, food utilization, and the sustainability of the dimensions above. Integrally addressing the four dimensions is vital. Surprisingly most of the works focused on a single dimension of food security: the physical availability of food. OBJECTIVE: The paper proposes a multi-dimensional dataset of open data and satellite images to characterize food security in the department of Cauca, Colombia. METHODS: The food security dataset integrates multiple open data sources; therefore, the Cross-Industry Standard Process for Data Mining methodology was used to guide the construction of the dataset. It includes sources such as population and agricultural census, nutrition surveys, and satellite images. RESULTS: An open multidimensional dataset for the Department of Cauca with 926 attributes and 9 rows (each row representing a Municipality) from multiple sources in Colombia, is configured. Then, machine learning models were used to characterize food security and nutrition in the Cauca Department. As a result, The Food security index calculated for Cauca using a linear regression model (Mean Absolute Error of 0.391) is 57.444 in a range between 0 and 100, with 100 the best score. Also, an approach for extracting four features (Agriculture, Habitation, Road, Water) of satellite images were tested with the ResNet50 model trained from scratch, having the best performance with a macro-accuracy, macro-precision, macro-recall, and macro-F1-score of 91.7, 86.2, 66.91, and 74.92%, respectively. CONCLUSION: It shows how the CRISP-DM methodology can be used to create an open public health data repository. Furthermore, this methodology could be generalized to other types of problems requiring the creation of a dataset. In addition, the use of satellite images presents an alternative for places where data collection is challenging. The model and methodology proposed based on open data become a low-cost and effective solution that could be used by decision-makers, especially in developing countries, to support food security planning. Frontiers Media S.A. 2022-01-27 /pmc/articles/PMC8828574/ /pubmed/35155518 http://dx.doi.org/10.3389/fnut.2021.796082 Text en Copyright © 2022 Restrepo, Pérez, López, Vargas-Cañas and Osorio-Valencia. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Nutrition
Restrepo, David S.
Pérez, Luis E.
López, Diego M.
Vargas-Cañas, Rubiel
Osorio-Valencia, Juan Sebastian
Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
title Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
title_full Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
title_fullStr Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
title_full_unstemmed Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
title_short Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
title_sort multi-dimensional dataset of open data and satellite images for characterization of food security and nutrition
topic Nutrition
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8828574/
https://www.ncbi.nlm.nih.gov/pubmed/35155518
http://dx.doi.org/10.3389/fnut.2021.796082
work_keys_str_mv AT restrepodavids multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition
AT perezluise multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition
AT lopezdiegom multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition
AT vargascanasrubiel multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition
AT osoriovalenciajuansebastian multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition