Cargando…

Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables

Oceanic and coastal ecosystems have undergone complex environmental changes in recent years, amid a context of climate change. These changes are also reflected in the dynamics of water-borne diseases as some of the causative agents of these illnesses are ubiquitous in the aquatic environment and the...

Descripción completa

Detalles Bibliográficos
Autores principales: Campbell, Amy Marie, Racault, Marie-Fanny, Goult, Stephen, Laurenson, Angus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7765326/
https://www.ncbi.nlm.nih.gov/pubmed/33333823
http://dx.doi.org/10.3390/ijerph17249378
_version_ 1783628465514217472
author Campbell, Amy Marie
Racault, Marie-Fanny
Goult, Stephen
Laurenson, Angus
author_facet Campbell, Amy Marie
Racault, Marie-Fanny
Goult, Stephen
Laurenson, Angus
author_sort Campbell, Amy Marie
collection PubMed
description Oceanic and coastal ecosystems have undergone complex environmental changes in recent years, amid a context of climate change. These changes are also reflected in the dynamics of water-borne diseases as some of the causative agents of these illnesses are ubiquitous in the aquatic environment and their survival rates are impacted by changes in climatic conditions. Previous studies have established strong relationships between essential climate variables and the coastal distribution and seasonal dynamics of the bacteria Vibrio cholerae, pathogenic types of which are responsible for human cholera disease. In this study we provide a novel exploration of the potential of a machine learning approach to forecast environmental cholera risk in coastal India, home to more than 200 million inhabitants, utilising atmospheric, terrestrial and oceanic satellite-derived essential climate variables. A Random Forest classifier model is developed, trained and tested on a cholera outbreak dataset over the period 2010–2018 for districts along coastal India. The random forest classifier model has an Accuracy of 0.99, an F1 Score of 0.942 and a Sensitivity score of 0.895, meaning that 89.5% of outbreaks are correctly identified. Spatio-temporal patterns emerged in terms of the model’s performance based on seasons and coastal locations. Further analysis of the specific contribution of each Essential Climate Variable to the model outputs shows that chlorophyll-a concentration, sea surface salinity and land surface temperature are the strongest predictors of the cholera outbreaks in the dataset used. The study reveals promising potential of the use of random forest classifiers and remotely-sensed essential climate variables for the development of environmental cholera-risk applications. Further exploration of the present random forest model and associated essential climate variables is encouraged on cholera surveillance datasets in other coastal areas affected by the disease to determine the model’s transferability potential and applicative value for cholera forecasting systems.
format Online
Article
Text
id pubmed-7765326
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77653262020-12-27 Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables Campbell, Amy Marie Racault, Marie-Fanny Goult, Stephen Laurenson, Angus Int J Environ Res Public Health Article Oceanic and coastal ecosystems have undergone complex environmental changes in recent years, amid a context of climate change. These changes are also reflected in the dynamics of water-borne diseases as some of the causative agents of these illnesses are ubiquitous in the aquatic environment and their survival rates are impacted by changes in climatic conditions. Previous studies have established strong relationships between essential climate variables and the coastal distribution and seasonal dynamics of the bacteria Vibrio cholerae, pathogenic types of which are responsible for human cholera disease. In this study we provide a novel exploration of the potential of a machine learning approach to forecast environmental cholera risk in coastal India, home to more than 200 million inhabitants, utilising atmospheric, terrestrial and oceanic satellite-derived essential climate variables. A Random Forest classifier model is developed, trained and tested on a cholera outbreak dataset over the period 2010–2018 for districts along coastal India. The random forest classifier model has an Accuracy of 0.99, an F1 Score of 0.942 and a Sensitivity score of 0.895, meaning that 89.5% of outbreaks are correctly identified. Spatio-temporal patterns emerged in terms of the model’s performance based on seasons and coastal locations. Further analysis of the specific contribution of each Essential Climate Variable to the model outputs shows that chlorophyll-a concentration, sea surface salinity and land surface temperature are the strongest predictors of the cholera outbreaks in the dataset used. The study reveals promising potential of the use of random forest classifiers and remotely-sensed essential climate variables for the development of environmental cholera-risk applications. Further exploration of the present random forest model and associated essential climate variables is encouraged on cholera surveillance datasets in other coastal areas affected by the disease to determine the model’s transferability potential and applicative value for cholera forecasting systems. MDPI 2020-12-15 2020-12 /pmc/articles/PMC7765326/ /pubmed/33333823 http://dx.doi.org/10.3390/ijerph17249378 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Campbell, Amy Marie
Racault, Marie-Fanny
Goult, Stephen
Laurenson, Angus
Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables
title Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables
title_full Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables
title_fullStr Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables
title_full_unstemmed Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables
title_short Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables
title_sort cholera risk: a machine learning approach applied to essential climate variables
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7765326/
https://www.ncbi.nlm.nih.gov/pubmed/33333823
http://dx.doi.org/10.3390/ijerph17249378
work_keys_str_mv AT campbellamymarie cholerariskamachinelearningapproachappliedtoessentialclimatevariables
AT racaultmariefanny cholerariskamachinelearningapproachappliedtoessentialclimatevariables
AT goultstephen cholerariskamachinelearningapproachappliedtoessentialclimatevariables
AT laurensonangus cholerariskamachinelearningapproachappliedtoessentialclimatevariables