Cargando…
Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict
This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8169216/ https://www.ncbi.nlm.nih.gov/pubmed/34062104 http://dx.doi.org/10.1098/rsif.2021.0104 |
_version_ | 1783702011855765504 |
---|---|
author | Giorgi, Emanuele Fronterrè, Claudio Macharia, Peter M. Alegana, Victor A. Snow, Robert W. Diggle, Peter J. |
author_facet | Giorgi, Emanuele Fronterrè, Claudio Macharia, Peter M. Alegana, Victor A. Snow, Robert W. Diggle, Peter J. |
author_sort | Giorgi, Emanuele |
collection | PubMed |
description | This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping in Tanzania. Throughout the paper, we distinguish between predictive modelling, whose main focus is on maximizing the predictive accuracy of the model, and explanatory modelling, where greater emphasis is placed on understanding the relationships between the health outcome and risk factors. We demonstrate that these two paradigms can result in different modelling choices. We also propose a simple approach for detecting over-fitting based on inspection of the correlation matrix of the estimators of the regression coefficients. To enhance the interpretability of geostatistical models, we introduce the concept of domain effects in order to assist variable selection and model validation. The statistical ideas and principles illustrated here in the specific context of disease prevalence mapping are more widely applicable to any regression model for the analysis of epidemiological outcomes but are particularly relevant to geostatistical models, for which the separation between fixed and random effects can be ambiguous. |
format | Online Article Text |
id | pubmed-8169216 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-81692162021-06-08 Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict Giorgi, Emanuele Fronterrè, Claudio Macharia, Peter M. Alegana, Victor A. Snow, Robert W. Diggle, Peter J. J R Soc Interface Review Articles This paper provides statistical guidance on the development and application of model-based geostatistical methods for disease prevalence mapping. We illustrate the different stages of the analysis, from exploratory analysis to spatial prediction of prevalence, through a case study on malaria mapping in Tanzania. Throughout the paper, we distinguish between predictive modelling, whose main focus is on maximizing the predictive accuracy of the model, and explanatory modelling, where greater emphasis is placed on understanding the relationships between the health outcome and risk factors. We demonstrate that these two paradigms can result in different modelling choices. We also propose a simple approach for detecting over-fitting based on inspection of the correlation matrix of the estimators of the regression coefficients. To enhance the interpretability of geostatistical models, we introduce the concept of domain effects in order to assist variable selection and model validation. The statistical ideas and principles illustrated here in the specific context of disease prevalence mapping are more widely applicable to any regression model for the analysis of epidemiological outcomes but are particularly relevant to geostatistical models, for which the separation between fixed and random effects can be ambiguous. The Royal Society 2021-06-02 /pmc/articles/PMC8169216/ /pubmed/34062104 http://dx.doi.org/10.1098/rsif.2021.0104 Text en © 2021 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Review Articles Giorgi, Emanuele Fronterrè, Claudio Macharia, Peter M. Alegana, Victor A. Snow, Robert W. Diggle, Peter J. Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
title | Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
title_full | Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
title_fullStr | Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
title_full_unstemmed | Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
title_short | Model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
title_sort | model building and assessment of the impact of covariates for disease prevalence mapping in low-resource settings: to explain and to predict |
topic | Review Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8169216/ https://www.ncbi.nlm.nih.gov/pubmed/34062104 http://dx.doi.org/10.1098/rsif.2021.0104 |
work_keys_str_mv | AT giorgiemanuele modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict AT fronterreclaudio modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict AT machariapeterm modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict AT aleganavictora modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict AT snowrobertw modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict AT digglepeterj modelbuildingandassessmentoftheimpactofcovariatesfordiseaseprevalencemappinginlowresourcesettingstoexplainandtopredict |