Cargando…
Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition
BACKGROUND: Nutrition is one of the main factors affecting the development and quality of life of a person. From a public health perspective, food security is an essential social determinant for promoting healthy nutrition. Food security embraces four dimensions: physical availability of food, econo...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8828574/ https://www.ncbi.nlm.nih.gov/pubmed/35155518 http://dx.doi.org/10.3389/fnut.2021.796082 |
_version_ | 1784647877941788672 |
---|---|
author | Restrepo, David S. Pérez, Luis E. López, Diego M. Vargas-Cañas, Rubiel Osorio-Valencia, Juan Sebastian |
author_facet | Restrepo, David S. Pérez, Luis E. López, Diego M. Vargas-Cañas, Rubiel Osorio-Valencia, Juan Sebastian |
author_sort | Restrepo, David S. |
collection | PubMed |
description | BACKGROUND: Nutrition is one of the main factors affecting the development and quality of life of a person. From a public health perspective, food security is an essential social determinant for promoting healthy nutrition. Food security embraces four dimensions: physical availability of food, economic and physical access to food, food utilization, and the sustainability of the dimensions above. Integrally addressing the four dimensions is vital. Surprisingly most of the works focused on a single dimension of food security: the physical availability of food. OBJECTIVE: The paper proposes a multi-dimensional dataset of open data and satellite images to characterize food security in the department of Cauca, Colombia. METHODS: The food security dataset integrates multiple open data sources; therefore, the Cross-Industry Standard Process for Data Mining methodology was used to guide the construction of the dataset. It includes sources such as population and agricultural census, nutrition surveys, and satellite images. RESULTS: An open multidimensional dataset for the Department of Cauca with 926 attributes and 9 rows (each row representing a Municipality) from multiple sources in Colombia, is configured. Then, machine learning models were used to characterize food security and nutrition in the Cauca Department. As a result, The Food security index calculated for Cauca using a linear regression model (Mean Absolute Error of 0.391) is 57.444 in a range between 0 and 100, with 100 the best score. Also, an approach for extracting four features (Agriculture, Habitation, Road, Water) of satellite images were tested with the ResNet50 model trained from scratch, having the best performance with a macro-accuracy, macro-precision, macro-recall, and macro-F1-score of 91.7, 86.2, 66.91, and 74.92%, respectively. CONCLUSION: It shows how the CRISP-DM methodology can be used to create an open public health data repository. Furthermore, this methodology could be generalized to other types of problems requiring the creation of a dataset. In addition, the use of satellite images presents an alternative for places where data collection is challenging. The model and methodology proposed based on open data become a low-cost and effective solution that could be used by decision-makers, especially in developing countries, to support food security planning. |
format | Online Article Text |
id | pubmed-8828574 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88285742022-02-11 Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition Restrepo, David S. Pérez, Luis E. López, Diego M. Vargas-Cañas, Rubiel Osorio-Valencia, Juan Sebastian Front Nutr Nutrition BACKGROUND: Nutrition is one of the main factors affecting the development and quality of life of a person. From a public health perspective, food security is an essential social determinant for promoting healthy nutrition. Food security embraces four dimensions: physical availability of food, economic and physical access to food, food utilization, and the sustainability of the dimensions above. Integrally addressing the four dimensions is vital. Surprisingly most of the works focused on a single dimension of food security: the physical availability of food. OBJECTIVE: The paper proposes a multi-dimensional dataset of open data and satellite images to characterize food security in the department of Cauca, Colombia. METHODS: The food security dataset integrates multiple open data sources; therefore, the Cross-Industry Standard Process for Data Mining methodology was used to guide the construction of the dataset. It includes sources such as population and agricultural census, nutrition surveys, and satellite images. RESULTS: An open multidimensional dataset for the Department of Cauca with 926 attributes and 9 rows (each row representing a Municipality) from multiple sources in Colombia, is configured. Then, machine learning models were used to characterize food security and nutrition in the Cauca Department. As a result, The Food security index calculated for Cauca using a linear regression model (Mean Absolute Error of 0.391) is 57.444 in a range between 0 and 100, with 100 the best score. Also, an approach for extracting four features (Agriculture, Habitation, Road, Water) of satellite images were tested with the ResNet50 model trained from scratch, having the best performance with a macro-accuracy, macro-precision, macro-recall, and macro-F1-score of 91.7, 86.2, 66.91, and 74.92%, respectively. CONCLUSION: It shows how the CRISP-DM methodology can be used to create an open public health data repository. Furthermore, this methodology could be generalized to other types of problems requiring the creation of a dataset. In addition, the use of satellite images presents an alternative for places where data collection is challenging. The model and methodology proposed based on open data become a low-cost and effective solution that could be used by decision-makers, especially in developing countries, to support food security planning. Frontiers Media S.A. 2022-01-27 /pmc/articles/PMC8828574/ /pubmed/35155518 http://dx.doi.org/10.3389/fnut.2021.796082 Text en Copyright © 2022 Restrepo, Pérez, López, Vargas-Cañas and Osorio-Valencia. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Nutrition Restrepo, David S. Pérez, Luis E. López, Diego M. Vargas-Cañas, Rubiel Osorio-Valencia, Juan Sebastian Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition |
title | Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition |
title_full | Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition |
title_fullStr | Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition |
title_full_unstemmed | Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition |
title_short | Multi-Dimensional Dataset of Open Data and Satellite Images for Characterization of Food Security and Nutrition |
title_sort | multi-dimensional dataset of open data and satellite images for characterization of food security and nutrition |
topic | Nutrition |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8828574/ https://www.ncbi.nlm.nih.gov/pubmed/35155518 http://dx.doi.org/10.3389/fnut.2021.796082 |
work_keys_str_mv | AT restrepodavids multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition AT perezluise multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition AT lopezdiegom multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition AT vargascanasrubiel multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition AT osoriovalenciajuansebastian multidimensionaldatasetofopendataandsatelliteimagesforcharacterizationoffoodsecurityandnutrition |