Cargando…

Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence

Maps of disease burden are a core tool needed for the control and elimination of malaria. Reliable routine surveillance data of malaria incidence, typically aggregated to administrative units, is becoming more widely available. Disaggregation regression is an important model framework for estimating...

Descripción completa

Detalles Bibliográficos
Autores principales: Lucas, Tim C.D., Nandi, Anita K., Keddie, Suzanne H., Chestnutt, Elisabeth G., Howes, Rosalind E., Rumisha, Susan F., Arambepola, Rohan, Bertozzi-Villa, Amelia, Python, Andre, Symons, Tasmin L., Millar, Justin J., Amratia, Punam, Hancock, Penelope, Battle, Katherine E., Cameron, Ewan, Gething, Peter W., Weiss, Daniel J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205339/
https://www.ncbi.nlm.nih.gov/pubmed/35691633
http://dx.doi.org/10.1016/j.sste.2020.100357
_version_ 1784729102747435008
author Lucas, Tim C.D.
Nandi, Anita K.
Keddie, Suzanne H.
Chestnutt, Elisabeth G.
Howes, Rosalind E.
Rumisha, Susan F.
Arambepola, Rohan
Bertozzi-Villa, Amelia
Python, Andre
Symons, Tasmin L.
Millar, Justin J.
Amratia, Punam
Hancock, Penelope
Battle, Katherine E.
Cameron, Ewan
Gething, Peter W.
Weiss, Daniel J.
author_facet Lucas, Tim C.D.
Nandi, Anita K.
Keddie, Suzanne H.
Chestnutt, Elisabeth G.
Howes, Rosalind E.
Rumisha, Susan F.
Arambepola, Rohan
Bertozzi-Villa, Amelia
Python, Andre
Symons, Tasmin L.
Millar, Justin J.
Amratia, Punam
Hancock, Penelope
Battle, Katherine E.
Cameron, Ewan
Gething, Peter W.
Weiss, Daniel J.
author_sort Lucas, Tim C.D.
collection PubMed
description Maps of disease burden are a core tool needed for the control and elimination of malaria. Reliable routine surveillance data of malaria incidence, typically aggregated to administrative units, is becoming more widely available. Disaggregation regression is an important model framework for estimating high resolution risk maps from aggregated data. However, the aggregation of incidence over large, heterogeneous areas means that these data are underpowered for estimating complex, non-linear models. In contrast, prevalence point-surveys are directly linked to local environmental conditions but are not common in many areas of the world. Here, we train multiple non-linear, machine learning models on Plasmodium falciparum prevalence point-surveys. We then ensemble the predictions from these machine learning models with a disaggregation regression model that uses aggregated malaria incidences as response data. We find that using a disaggregation regression model to combine predictions from machine learning models improves model accuracy relative to a baseline model.
format Online
Article
Text
id pubmed-9205339
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-92053392022-06-24 Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence Lucas, Tim C.D. Nandi, Anita K. Keddie, Suzanne H. Chestnutt, Elisabeth G. Howes, Rosalind E. Rumisha, Susan F. Arambepola, Rohan Bertozzi-Villa, Amelia Python, Andre Symons, Tasmin L. Millar, Justin J. Amratia, Punam Hancock, Penelope Battle, Katherine E. Cameron, Ewan Gething, Peter W. Weiss, Daniel J. Spat Spatiotemporal Epidemiol Article Maps of disease burden are a core tool needed for the control and elimination of malaria. Reliable routine surveillance data of malaria incidence, typically aggregated to administrative units, is becoming more widely available. Disaggregation regression is an important model framework for estimating high resolution risk maps from aggregated data. However, the aggregation of incidence over large, heterogeneous areas means that these data are underpowered for estimating complex, non-linear models. In contrast, prevalence point-surveys are directly linked to local environmental conditions but are not common in many areas of the world. Here, we train multiple non-linear, machine learning models on Plasmodium falciparum prevalence point-surveys. We then ensemble the predictions from these machine learning models with a disaggregation regression model that uses aggregated malaria incidences as response data. We find that using a disaggregation regression model to combine predictions from machine learning models improves model accuracy relative to a baseline model. Elsevier 2022-06 /pmc/articles/PMC9205339/ /pubmed/35691633 http://dx.doi.org/10.1016/j.sste.2020.100357 Text en © 2020 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lucas, Tim C.D.
Nandi, Anita K.
Keddie, Suzanne H.
Chestnutt, Elisabeth G.
Howes, Rosalind E.
Rumisha, Susan F.
Arambepola, Rohan
Bertozzi-Villa, Amelia
Python, Andre
Symons, Tasmin L.
Millar, Justin J.
Amratia, Punam
Hancock, Penelope
Battle, Katherine E.
Cameron, Ewan
Gething, Peter W.
Weiss, Daniel J.
Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
title Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
title_full Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
title_fullStr Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
title_full_unstemmed Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
title_short Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
title_sort improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205339/
https://www.ncbi.nlm.nih.gov/pubmed/35691633
http://dx.doi.org/10.1016/j.sste.2020.100357
work_keys_str_mv AT lucastimcd improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT nandianitak improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT keddiesuzanneh improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT chestnuttelisabethg improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT howesrosalinde improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT rumishasusanf improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT arambepolarohan improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT bertozzivillaamelia improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT pythonandre improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT symonstasminl improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT millarjustinj improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT amratiapunam improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT hancockpenelope improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT battlekatherinee improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT cameronewan improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT gethingpeterw improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence
AT weissdanielj improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence