Cargando…
Estimating range of influence in case of missing spatial data: a simulation study on binary data
BACKGROUND: The range of influence refers to the average distance between locations at which the observed outcome is no longer correlated. In many studies, missing data occur and a popular tool for handling missing data is multiple imputation. The objective of this study was to investigate how the e...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4325952/ https://www.ncbi.nlm.nih.gov/pubmed/25563056 http://dx.doi.org/10.1186/1476-072X-14-1 |
_version_ | 1782356873699131392 |
---|---|
author | Bihrmann, Kristine Ersbøll, Annette K |
author_facet | Bihrmann, Kristine Ersbøll, Annette K |
author_sort | Bihrmann, Kristine |
collection | PubMed |
description | BACKGROUND: The range of influence refers to the average distance between locations at which the observed outcome is no longer correlated. In many studies, missing data occur and a popular tool for handling missing data is multiple imputation. The objective of this study was to investigate how the estimated range of influence is affected when 1) the outcome is only observed at some of a given set of locations, and 2) multiple imputation is used to impute the outcome at the non-observed locations. METHODS: The study was based on the simulation of missing outcomes in a complete data set. The range of influence was estimated from a logistic regression model with a spatially structured random effect, modelled by a Gaussian field. Results were evaluated by comparing estimates obtained from complete, missing, and imputed data. RESULTS: In most simulation scenarios, the range estimates were consistent with ≤25% missing data. In some scenarios, however, the range estimate was affected by even a moderate number of missing observations. Multiple imputation provided a potential improvement in the range estimate with ≥50% missing data, but also increased the uncertainty of the estimate. CONCLUSIONS: The effect of missing observations on the estimated range of influence depended to some extent on the missing data mechanism. In general, the overall effect of missing observations was small compared to the uncertainty of the range estimate. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1476-072X-14-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4325952 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43259522015-02-13 Estimating range of influence in case of missing spatial data: a simulation study on binary data Bihrmann, Kristine Ersbøll, Annette K Int J Health Geogr Methodology BACKGROUND: The range of influence refers to the average distance between locations at which the observed outcome is no longer correlated. In many studies, missing data occur and a popular tool for handling missing data is multiple imputation. The objective of this study was to investigate how the estimated range of influence is affected when 1) the outcome is only observed at some of a given set of locations, and 2) multiple imputation is used to impute the outcome at the non-observed locations. METHODS: The study was based on the simulation of missing outcomes in a complete data set. The range of influence was estimated from a logistic regression model with a spatially structured random effect, modelled by a Gaussian field. Results were evaluated by comparing estimates obtained from complete, missing, and imputed data. RESULTS: In most simulation scenarios, the range estimates were consistent with ≤25% missing data. In some scenarios, however, the range estimate was affected by even a moderate number of missing observations. Multiple imputation provided a potential improvement in the range estimate with ≥50% missing data, but also increased the uncertainty of the estimate. CONCLUSIONS: The effect of missing observations on the estimated range of influence depended to some extent on the missing data mechanism. In general, the overall effect of missing observations was small compared to the uncertainty of the range estimate. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1476-072X-14-1) contains supplementary material, which is available to authorized users. BioMed Central 2015-01-06 /pmc/articles/PMC4325952/ /pubmed/25563056 http://dx.doi.org/10.1186/1476-072X-14-1 Text en © Bihrmann and Ersbøll; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Bihrmann, Kristine Ersbøll, Annette K Estimating range of influence in case of missing spatial data: a simulation study on binary data |
title | Estimating range of influence in case of missing spatial data: a simulation study on binary data |
title_full | Estimating range of influence in case of missing spatial data: a simulation study on binary data |
title_fullStr | Estimating range of influence in case of missing spatial data: a simulation study on binary data |
title_full_unstemmed | Estimating range of influence in case of missing spatial data: a simulation study on binary data |
title_short | Estimating range of influence in case of missing spatial data: a simulation study on binary data |
title_sort | estimating range of influence in case of missing spatial data: a simulation study on binary data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4325952/ https://www.ncbi.nlm.nih.gov/pubmed/25563056 http://dx.doi.org/10.1186/1476-072X-14-1 |
work_keys_str_mv | AT bihrmannkristine estimatingrangeofinfluenceincaseofmissingspatialdataasimulationstudyonbinarydata AT ersbøllannettek estimatingrangeofinfluenceincaseofmissingspatialdataasimulationstudyonbinarydata |