Cargando…
Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys
Monitoring safe water access in developing countries relies primarily on household health survey and census data. These surveys are often incomplete: they tend to focus on the primary water source only, are spatially coarse, and usually happen every 5-10 years, during which significant changes can h...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439218/ https://www.ncbi.nlm.nih.gov/pubmed/37596313 http://dx.doi.org/10.1038/s41598-023-39917-6 |
_version_ | 1785092898586361856 |
---|---|
author | Geleijnse, Jan Rutten, Martine de Villiers, Didier Bamwenda, James Tayebwa Abraham, Edo |
author_facet | Geleijnse, Jan Rutten, Martine de Villiers, Didier Bamwenda, James Tayebwa Abraham, Edo |
author_sort | Geleijnse, Jan |
collection | PubMed |
description | Monitoring safe water access in developing countries relies primarily on household health survey and census data. These surveys are often incomplete: they tend to focus on the primary water source only, are spatially coarse, and usually happen every 5-10 years, during which significant changes can happen in urbanisation and infrastructure provision, especially in sub Saharan Africa. In this work, we present a data-driven approach that utilises and compliments survey based data of water access, to provide context-specific and disaggregated monitoring. The level of access to improved water and sanitation has been shown to vary with geographical inequalities related to the availability of water resources and terrain, population density and socio-economic determinants such as income and education. We use such data and successfully predict the level of water access in areas for which data is lacking, providing spatially explicit and community level monitoring possibilities for mapping geographical inequalities in access. This is showcased by applying three machine learning models that use such geographical data to predict the number of presences of water access points of eight different access types across Uganda, with a 1km by 1km grid resolution. Two Multi-Layer-Perceptron (MLP) models and a Maximum Entropy (MaxEnt) model are developed and compared, where the former are shown to consistently outperform the latter. The best performing Neural Network model achieved a True Positive Rate of 0.89 and a False Positive Rate of 0.24, compared to 0.85 and 0.46 respectively for the MaxEnt model. The models improve on previous work on water point modeling through the use of neural networks, in addition to introducing the True Positive - and False Positive Rate as better evaluation metrics to also assess the MaxEnt model. We also present a scaling method to move from predicting only the relative probability of water point presences, to predicting the absolute number of presences. To challenge both the model results and the more standard health surveys, a new household level survey is carried out in Bushenyi, a mid-sized town in the South-West of Uganda, asking specifically about the multitude of water sources. On average Bushenyi households reported to use 1.9 water sources. The survey further showed that the actual presence of a source, does not always imply that it is used. Therefore it is no option to rely solely on models for water access monitoring. For this, household surveys remain necessary but should be extended with questions on the multiple sources that are used by households. |
format | Online Article Text |
id | pubmed-10439218 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-104392182023-08-20 Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys Geleijnse, Jan Rutten, Martine de Villiers, Didier Bamwenda, James Tayebwa Abraham, Edo Sci Rep Article Monitoring safe water access in developing countries relies primarily on household health survey and census data. These surveys are often incomplete: they tend to focus on the primary water source only, are spatially coarse, and usually happen every 5-10 years, during which significant changes can happen in urbanisation and infrastructure provision, especially in sub Saharan Africa. In this work, we present a data-driven approach that utilises and compliments survey based data of water access, to provide context-specific and disaggregated monitoring. The level of access to improved water and sanitation has been shown to vary with geographical inequalities related to the availability of water resources and terrain, population density and socio-economic determinants such as income and education. We use such data and successfully predict the level of water access in areas for which data is lacking, providing spatially explicit and community level monitoring possibilities for mapping geographical inequalities in access. This is showcased by applying three machine learning models that use such geographical data to predict the number of presences of water access points of eight different access types across Uganda, with a 1km by 1km grid resolution. Two Multi-Layer-Perceptron (MLP) models and a Maximum Entropy (MaxEnt) model are developed and compared, where the former are shown to consistently outperform the latter. The best performing Neural Network model achieved a True Positive Rate of 0.89 and a False Positive Rate of 0.24, compared to 0.85 and 0.46 respectively for the MaxEnt model. The models improve on previous work on water point modeling through the use of neural networks, in addition to introducing the True Positive - and False Positive Rate as better evaluation metrics to also assess the MaxEnt model. We also present a scaling method to move from predicting only the relative probability of water point presences, to predicting the absolute number of presences. To challenge both the model results and the more standard health surveys, a new household level survey is carried out in Bushenyi, a mid-sized town in the South-West of Uganda, asking specifically about the multitude of water sources. On average Bushenyi households reported to use 1.9 water sources. The survey further showed that the actual presence of a source, does not always imply that it is used. Therefore it is no option to rely solely on models for water access monitoring. For this, household surveys remain necessary but should be extended with questions on the multiple sources that are used by households. Nature Publishing Group UK 2023-08-18 /pmc/articles/PMC10439218/ /pubmed/37596313 http://dx.doi.org/10.1038/s41598-023-39917-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Geleijnse, Jan Rutten, Martine de Villiers, Didier Bamwenda, James Tayebwa Abraham, Edo Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
title | Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
title_full | Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
title_fullStr | Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
title_full_unstemmed | Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
title_short | Enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
title_sort | enhancing water access monitoring through mapping multi-source usage and disaggregated geographic inequalities with machine learning and surveys |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439218/ https://www.ncbi.nlm.nih.gov/pubmed/37596313 http://dx.doi.org/10.1038/s41598-023-39917-6 |
work_keys_str_mv | AT geleijnsejan enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys AT ruttenmartine enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys AT devilliersdidier enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys AT bamwendajamestayebwa enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys AT abrahamedo enhancingwateraccessmonitoringthroughmappingmultisourceusageanddisaggregatedgeographicinequalitieswithmachinelearningandsurveys |