Cargando…

Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict

This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping...

Descripción completa

Detalles Bibliográficos
Autores principales: Giorgi, Emanuele, Fronterrè, Claudio, Macharia, Peter M., Alegana, Victor A., Snow, Robert W., Diggle, Peter J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8169216/
https://www.ncbi.nlm.nih.gov/pubmed/34062104
http://dx.doi.org/10.1098/rsif.2021.0104
_version_ 1783702011855765504
author Giorgi, Emanuele
Fronterrè, Claudio
Macharia, Peter M.
Alegana, Victor A.
Snow, Robert W.
Diggle, Peter J.
author_facet Giorgi, Emanuele
Fronterrè, Claudio
Macharia, Peter M.
Alegana, Victor A.
Snow, Robert W.
Diggle, Peter J.
author_sort Giorgi, Emanuele
collection PubMed
description This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping in Tanzania. Throughout the paper, we distinguish between predictive modelling, whose main focus is on maximizing the predictive accuracy of the model, and explanatory modelling, where greater emphasis is placed on understanding the relationships between the health outcome and risk factors. We demonstrate that these two paradigms can result in different modelling choices. We also propose a simple approach for detecting over-fitting based on inspection of the correlation matrix of the estimators of the regression coefficients. To enhance the interpretability of geostatistical models, we introduce the concept of domain effects in order to assist variable selection and model validation. The statistical ideas and principles illustrated here in the specific context of disease prevalence mapping are more widely applicable to any regression model for the analysis of epidemiological outcomes but are particularly relevant to geostatistical models, for which the separation between fixed and random effects can be ambiguous.
format Online
Article
Text
id pubmed-8169216
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-81692162021-06-08 Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict Giorgi, Emanuele Fronterrè, Claudio Macharia, Peter M. Alegana, Victor A. Snow, Robert W. Diggle, Peter J. J R Soc Interface Review Articles This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping in Tanzania. Throughout the paper, we distinguish between predictive modelling, whose main focus is on maximizing the predictive accuracy of the model, and explanatory modelling, where greater emphasis is placed on understanding the relationships between the health outcome and risk factors. We demonstrate that these two paradigms can result in different modelling choices. We also propose a simple approach for detecting over-fitting based on inspection of the correlation matrix of the estimators of the regression coefficients. To enhance the interpretability of geostatistical models, we introduce the concept of domain effects in order to assist variable selection and model validation. The statistical ideas and principles illustrated here in the specific context of disease prevalence mapping are more widely applicable to any regression model for the analysis of epidemiological outcomes but are particularly relevant to geostatistical models, for which the separation between fixed and random effects can be ambiguous. The Royal Society 2021-06-02 /pmc/articles/PMC8169216/ /pubmed/34062104 http://dx.doi.org/10.1098/rsif.2021.0104 Text en © 2021 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Review Articles
Giorgi, Emanuele
Fronterrè, Claudio
Macharia, Peter M.
Alegana, Victor A.
Snow, Robert W.
Diggle, Peter J.
Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
title Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
title_full Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
title_fullStr Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
title_full_unstemmed Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
title_short Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
title_sort model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
topic Review Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8169216/
https://www.ncbi.nlm.nih.gov/pubmed/34062104
http://dx.doi.org/10.1098/rsif.2021.0104
work_keys_str_mv AT giorgiemanuele modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict
AT fronterreclaudio modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict
AT machariapeterm modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict
AT aleganavictora modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict
AT snowrobertw modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict
AT digglepeterj modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict