Cargando…

Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys

Monitoring safe water access in developing countries relies primarily on household health survey and census data. These surveys are often incomplete: they tend to focus on the primary water source only, are spatially coarse, and usually happen every 5-10 years, during which significant changes can h...

Descripción completa

Detalles Bibliográficos
Autores principales: Geleijnse, Jan, Rutten, Martine, de Villiers, Didier, Bamwenda, James Tayebwa, Abraham, Edo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439218/
https://www.ncbi.nlm.nih.gov/pubmed/37596313
http://dx.doi.org/10.1038/s41598-023-39917-6
_version_ 1785092898586361856
author Geleijnse, Jan
Rutten, Martine
de Villiers, Didier
Bamwenda, James Tayebwa
Abraham, Edo
author_facet Geleijnse, Jan
Rutten, Martine
de Villiers, Didier
Bamwenda, James Tayebwa
Abraham, Edo
author_sort Geleijnse, Jan
collection PubMed
description Monitoring safe water access in developing countries relies primarily on household health survey and census data. These surveys are often incomplete: they tend to focus on the primary water source only, are spatially coarse, and usually happen every 5-10 years, during which significant changes can happen in urbanisation and infrastructure provision, especially in sub Saharan Africa. In this work, we present a data-driven approach that utilises and compliments survey based data of water access, to provide context-specific and disaggregated monitoring. The level of access to improved water and sanitation has been shown to vary with geographical inequalities related to the availability of water resources and terrain, population density and socio-economic determinants such as income and education. We use such data and successfully predict the level of water access in areas for which data is lacking, providing spatially explicit and community level monitoring possibilities for mapping geographical inequalities in access. This is showcased by applying three machine learning models that use such geographical data to predict the number of presences of water access points of eight different access types across Uganda, with a 1km by 1km grid resolution. Two Multi-Layer-Perceptron (MLP) models and a Maximum Entropy (MaxEnt) model are developed and compared, where the former are shown to consistently outperform the latter. The best performing Neural Network model achieved a True Positive Rate of 0.89 and a False Positive Rate of 0.24, compared to 0.85 and 0.46 respectively for the MaxEnt model. The models improve on previous work on water point modeling through the use of neural networks, in addition to introducing the True Positive - and False Positive Rate as better evaluation metrics to also assess the MaxEnt model. We also present a scaling method to move from predicting only the relative probability of water point presences, to predicting the absolute number of presences. To challenge both the model results and the more standard health surveys, a new household level survey is carried out in Bushenyi, a mid-sized town in the South-West of Uganda, asking specifically about the multitude of water sources. On average Bushenyi households reported to use 1.9 water sources. The survey further showed that the actual presence of a source, does not always imply that it is used. Therefore it is no option to rely solely on models for water access monitoring. For this, household surveys remain necessary but should be extended with questions on the multiple sources that are used by households.
format Online
Article
Text
id pubmed-10439218
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104392182023-08-20 Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys Geleijnse, Jan Rutten, Martine de Villiers, Didier Bamwenda, James Tayebwa Abraham, Edo Sci Rep Article Monitoring safe water access in developing countries relies primarily on household health survey and census data. These surveys are often incomplete: they tend to focus on the primary water source only, are spatially coarse, and usually happen every 5-10 years, during which significant changes can happen in urbanisation and infrastructure provision, especially in sub Saharan Africa. In this work, we present a data-driven approach that utilises and compliments survey based data of water access, to provide context-specific and disaggregated monitoring. The level of access to improved water and sanitation has been shown to vary with geographical inequalities related to the availability of water resources and terrain, population density and socio-economic determinants such as income and education. We use such data and successfully predict the level of water access in areas for which data is lacking, providing spatially explicit and community level monitoring possibilities for mapping geographical inequalities in access. This is showcased by applying three machine learning models that use such geographical data to predict the number of presences of water access points of eight different access types across Uganda, with a 1km by 1km grid resolution. Two Multi-Layer-Perceptron (MLP) models and a Maximum Entropy (MaxEnt) model are developed and compared, where the former are shown to consistently outperform the latter. The best performing Neural Network model achieved a True Positive Rate of 0.89 and a False Positive Rate of 0.24, compared to 0.85 and 0.46 respectively for the MaxEnt model. The models improve on previous work on water point modeling through the use of neural networks, in addition to introducing the True Positive - and False Positive Rate as better evaluation metrics to also assess the MaxEnt model. We also present a scaling method to move from predicting only the relative probability of water point presences, to predicting the absolute number of presences. To challenge both the model results and the more standard health surveys, a new household level survey is carried out in Bushenyi, a mid-sized town in the South-West of Uganda, asking specifically about the multitude of water sources. On average Bushenyi households reported to use 1.9 water sources. The survey further showed that the actual presence of a source, does not always imply that it is used. Therefore it is no option to rely solely on models for water access monitoring. For this, household surveys remain necessary but should be extended with questions on the multiple sources that are used by households. Nature Publishing Group UK 2023-08-18 /pmc/articles/PMC10439218/ /pubmed/37596313 http://dx.doi.org/10.1038/s41598-023-39917-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Geleijnse, Jan
Rutten, Martine
de Villiers, Didier
Bamwenda, James Tayebwa
Abraham, Edo
Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
title Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
title_full Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
title_fullStr Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
title_full_unstemmed Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
title_short Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
title_sort enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439218/
https://www.ncbi.nlm.nih.gov/pubmed/37596313
http://dx.doi.org/10.1038/s41598-023-39917-6
work_keys_str_mv AT geleijnsejan enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys
AT ruttenmartine enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys
AT devilliersdidier enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys
AT bamwendajamestayebwa enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys
AT abrahamedo enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys