Cargando…
Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence
Maps of disease burden are a core tool needed for the control and elimination of malaria. Reliable routine surveillance data of malaria incidence, typically aggregated to administrative units, is becoming more widely available. Disaggregation regression is an important model framework for estimating...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205339/ https://www.ncbi.nlm.nih.gov/pubmed/35691633 http://dx.doi.org/10.1016/j.sste.2020.100357 |
_version_ | 1784729102747435008 |
---|---|
author | Lucas, Tim C.D. Nandi, Anita K. Keddie, Suzanne H. Chestnutt, Elisabeth G. Howes, Rosalind E. Rumisha, Susan F. Arambepola, Rohan Bertozzi-Villa, Amelia Python, Andre Symons, Tasmin L. Millar, Justin J. Amratia, Punam Hancock, Penelope Battle, Katherine E. Cameron, Ewan Gething, Peter W. Weiss, Daniel J. |
author_facet | Lucas, Tim C.D. Nandi, Anita K. Keddie, Suzanne H. Chestnutt, Elisabeth G. Howes, Rosalind E. Rumisha, Susan F. Arambepola, Rohan Bertozzi-Villa, Amelia Python, Andre Symons, Tasmin L. Millar, Justin J. Amratia, Punam Hancock, Penelope Battle, Katherine E. Cameron, Ewan Gething, Peter W. Weiss, Daniel J. |
author_sort | Lucas, Tim C.D. |
collection | PubMed |
description | Maps of disease burden are a core tool needed for the control and elimination of malaria. Reliable routine surveillance data of malaria incidence, typically aggregated to administrative units, is becoming more widely available. Disaggregation regression is an important model framework for estimating high resolution risk maps from aggregated data. However, the aggregation of incidence over large, heterogeneous areas means that these data are underpowered for estimating complex, non-linear models. In contrast, prevalence point-surveys are directly linked to local environmental conditions but are not common in many areas of the world. Here, we train multiple non-linear, machine learning models on Plasmodium falciparum prevalence point-surveys. We then ensemble the predictions from these machine learning models with a disaggregation regression model that uses aggregated malaria incidences as response data. We find that using a disaggregation regression model to combine predictions from machine learning models improves model accuracy relative to a baseline model. |
format | Online Article Text |
id | pubmed-9205339 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-92053392022-06-24 Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence Lucas, Tim C.D. Nandi, Anita K. Keddie, Suzanne H. Chestnutt, Elisabeth G. Howes, Rosalind E. Rumisha, Susan F. Arambepola, Rohan Bertozzi-Villa, Amelia Python, Andre Symons, Tasmin L. Millar, Justin J. Amratia, Punam Hancock, Penelope Battle, Katherine E. Cameron, Ewan Gething, Peter W. Weiss, Daniel J. Spat Spatiotemporal Epidemiol Article Maps of disease burden are a core tool needed for the control and elimination of malaria. Reliable routine surveillance data of malaria incidence, typically aggregated to administrative units, is becoming more widely available. Disaggregation regression is an important model framework for estimating high resolution risk maps from aggregated data. However, the aggregation of incidence over large, heterogeneous areas means that these data are underpowered for estimating complex, non-linear models. In contrast, prevalence point-surveys are directly linked to local environmental conditions but are not common in many areas of the world. Here, we train multiple non-linear, machine learning models on Plasmodium falciparum prevalence point-surveys. We then ensemble the predictions from these machine learning models with a disaggregation regression model that uses aggregated malaria incidences as response data. We find that using a disaggregation regression model to combine predictions from machine learning models improves model accuracy relative to a baseline model. Elsevier 2022-06 /pmc/articles/PMC9205339/ /pubmed/35691633 http://dx.doi.org/10.1016/j.sste.2020.100357 Text en © 2020 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lucas, Tim C.D. Nandi, Anita K. Keddie, Suzanne H. Chestnutt, Elisabeth G. Howes, Rosalind E. Rumisha, Susan F. Arambepola, Rohan Bertozzi-Villa, Amelia Python, Andre Symons, Tasmin L. Millar, Justin J. Amratia, Punam Hancock, Penelope Battle, Katherine E. Cameron, Ewan Gething, Peter W. Weiss, Daniel J. Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
title | Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
title_full | Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
title_fullStr | Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
title_full_unstemmed | Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
title_short | Improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
title_sort | improving disaggregation models of malaria incidence by ensembling non-linear models of prevalence |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205339/ https://www.ncbi.nlm.nih.gov/pubmed/35691633 http://dx.doi.org/10.1016/j.sste.2020.100357 |
work_keys_str_mv | AT lucastimcd improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT nandianitak improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT keddiesuzanneh improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT chestnuttelisabethg improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT howesrosalinde improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT rumishasusanf improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT arambepolarohan improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT bertozzivillaamelia improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT pythonandre improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT symonstasminl improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT millarjustinj improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT amratiapunam improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT hancockpenelope improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT battlekatherinee improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT cameronewan improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT gethingpeterw improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence AT weissdanielj improvingdisaggregationmodelsofmalariaincidencebyensemblingnonlinearmodelsofprevalence |