Cargando…

Estimating the accuracy of geographical imputation

BACKGROUND: To reduce the number of non-geocoded cases researchers and organizations sometimes include cases geocoded to postal code centroids along with cases geocoded with the greater precision of a full street address. Some analysts then use the postal code to assign information to the cases from...

Descripción completa

Detalles Bibliográficos
Autores principales: Henry, Kevin A, Boscoe, Francis P
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2266732/
https://www.ncbi.nlm.nih.gov/pubmed/18215308
http://dx.doi.org/10.1186/1476-072X-7-3
_version_ 1782151549654401024
author Henry, Kevin A
Boscoe, Francis P
author_facet Henry, Kevin A
Boscoe, Francis P
author_sort Henry, Kevin A
collection PubMed
description BACKGROUND: To reduce the number of non-geocoded cases researchers and organizations sometimes include cases geocoded to postal code centroids along with cases geocoded with the greater precision of a full street address. Some analysts then use the postal code to assign information to the cases from finer-level geographies such as a census tract. Assignment is commonly completed using either a postal centroid or by a geographical imputation method which assigns a location by using both the demographic characteristics of the case and the population characteristics of the postal delivery area. To date no systematic evaluation of geographical imputation methods ("geo-imputation") has been completed. The objective of this study was to determine the accuracy of census tract assignment using geo-imputation. METHODS: Using a large dataset of breast, prostate and colorectal cancer cases reported to the New Jersey Cancer Registry, we determined how often cases were assigned to the correct census tract using alternate strategies of demographic based geo-imputation, and using assignments obtained from postal code centroids. Assignment accuracy was measured by comparing the tract assigned with the tract originally identified from the full street address. RESULTS: Assigning cases to census tracts using the race/ethnicity population distribution within a postal code resulted in more correctly assigned cases than when using postal code centroids. The addition of age characteristics increased the match rates even further. Match rates were highly dependent on both the geographic distribution of race/ethnicity groups and population density. CONCLUSION: Geo-imputation appears to offer some advantages and no serious drawbacks as compared with the alternative of assigning cases to census tracts based on postal code centroids. For a specific analysis, researchers will still need to consider the potential impact of geocoding quality on their results and evaluate the possibility that it might introduce geographical bias.
format Text
id pubmed-2266732
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22667322008-03-11 Estimating the accuracy of geographical imputation Henry, Kevin A Boscoe, Francis P Int J Health Geogr Methodology BACKGROUND: To reduce the number of non-geocoded cases researchers and organizations sometimes include cases geocoded to postal code centroids along with cases geocoded with the greater precision of a full street address. Some analysts then use the postal code to assign information to the cases from finer-level geographies such as a census tract. Assignment is commonly completed using either a postal centroid or by a geographical imputation method which assigns a location by using both the demographic characteristics of the case and the population characteristics of the postal delivery area. To date no systematic evaluation of geographical imputation methods ("geo-imputation") has been completed. The objective of this study was to determine the accuracy of census tract assignment using geo-imputation. METHODS: Using a large dataset of breast, prostate and colorectal cancer cases reported to the New Jersey Cancer Registry, we determined how often cases were assigned to the correct census tract using alternate strategies of demographic based geo-imputation, and using assignments obtained from postal code centroids. Assignment accuracy was measured by comparing the tract assigned with the tract originally identified from the full street address. RESULTS: Assigning cases to census tracts using the race/ethnicity population distribution within a postal code resulted in more correctly assigned cases than when using postal code centroids. The addition of age characteristics increased the match rates even further. Match rates were highly dependent on both the geographic distribution of race/ethnicity groups and population density. CONCLUSION: Geo-imputation appears to offer some advantages and no serious drawbacks as compared with the alternative of assigning cases to census tracts based on postal code centroids. For a specific analysis, researchers will still need to consider the potential impact of geocoding quality on their results and evaluate the possibility that it might introduce geographical bias. BioMed Central 2008-01-23 /pmc/articles/PMC2266732/ /pubmed/18215308 http://dx.doi.org/10.1186/1476-072X-7-3 Text en Copyright © 2008 Henry and Boscoe; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Henry, Kevin A
Boscoe, Francis P
Estimating the accuracy of geographical imputation
title Estimating the accuracy of geographical imputation
title_full Estimating the accuracy of geographical imputation
title_fullStr Estimating the accuracy of geographical imputation
title_full_unstemmed Estimating the accuracy of geographical imputation
title_short Estimating the accuracy of geographical imputation
title_sort estimating the accuracy of geographical imputation
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2266732/
https://www.ncbi.nlm.nih.gov/pubmed/18215308
http://dx.doi.org/10.1186/1476-072X-7-3
work_keys_str_mv AT henrykevina estimatingtheaccuracyofgeographicalimputation
AT boscoefrancisp estimatingtheaccuracyofgeographicalimputation