Cargando…
The utility of Zip4 codes in spatial epidemiological analysis
There are many public health situations within the United States that require fine geographical scale data to effectively inform response and intervention strategies. However, a condition for accessing and analyzing such data, especially when multiple institutions are involved, is being able to pres...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10231782/ https://www.ncbi.nlm.nih.gov/pubmed/37256874 http://dx.doi.org/10.1371/journal.pone.0285552 |
_version_ | 1785051810068692992 |
---|---|
author | Ajayakumar, Jayakrishnan Curtis, Andrew Curtis, Jacqueline |
author_facet | Ajayakumar, Jayakrishnan Curtis, Andrew Curtis, Jacqueline |
author_sort | Ajayakumar, Jayakrishnan |
collection | PubMed |
description | There are many public health situations within the United States that require fine geographical scale data to effectively inform response and intervention strategies. However, a condition for accessing and analyzing such data, especially when multiple institutions are involved, is being able to preserve a degree of spatial privacy and confidentiality. Hospitals and state health departments, who are generally the custodians of these fine-scale health data, are sometimes understandably hesitant to collaborate with each other due to these concerns. This paper looks at the utility and pitfalls of using Zip4 codes, a data layer often included as it is believed to be “safe”, as a source for sharing fine-scale spatial health data that enables privacy preservation while maintaining a suitable precision for spatial analysis. While the Zip4 is widely supplied, researchers seldom utilize it. Nor is its spatial characteristics known by data guardians. To address this gap, we use the context of a near-real time spatial response to an emerging health threat to show how the Zip4 aggregation preserves an underlying spatial structure making it potentially suitable dataset for analysis. Our results suggest that based on the density of urbanization, Zip4 centroids are within 150 meters of the real location almost 99% of the time. Spatial analysis experiments performed on these Zip4 data suggest a far more insightful geographic output than if using more commonly used aggregation units such as street lines and census block groups. However, this improvement in analytical output comes at a spatial privy cost as Zip4 centroids have a higher potential of compromising spatial anonymity with 73% of addresses having a spatial k anonymity value less than 5 when compared to other aggregations. We conclude that while offers an exciting opportunity to share data between organizations, researchers and analysts need to be made aware of the potential for serious confidentiality violations. |
format | Online Article Text |
id | pubmed-10231782 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-102317822023-06-01 The utility of Zip4 codes in spatial epidemiological analysis Ajayakumar, Jayakrishnan Curtis, Andrew Curtis, Jacqueline PLoS One Research Article There are many public health situations within the United States that require fine geographical scale data to effectively inform response and intervention strategies. However, a condition for accessing and analyzing such data, especially when multiple institutions are involved, is being able to preserve a degree of spatial privacy and confidentiality. Hospitals and state health departments, who are generally the custodians of these fine-scale health data, are sometimes understandably hesitant to collaborate with each other due to these concerns. This paper looks at the utility and pitfalls of using Zip4 codes, a data layer often included as it is believed to be “safe”, as a source for sharing fine-scale spatial health data that enables privacy preservation while maintaining a suitable precision for spatial analysis. While the Zip4 is widely supplied, researchers seldom utilize it. Nor is its spatial characteristics known by data guardians. To address this gap, we use the context of a near-real time spatial response to an emerging health threat to show how the Zip4 aggregation preserves an underlying spatial structure making it potentially suitable dataset for analysis. Our results suggest that based on the density of urbanization, Zip4 centroids are within 150 meters of the real location almost 99% of the time. Spatial analysis experiments performed on these Zip4 data suggest a far more insightful geographic output than if using more commonly used aggregation units such as street lines and census block groups. However, this improvement in analytical output comes at a spatial privy cost as Zip4 centroids have a higher potential of compromising spatial anonymity with 73% of addresses having a spatial k anonymity value less than 5 when compared to other aggregations. We conclude that while offers an exciting opportunity to share data between organizations, researchers and analysts need to be made aware of the potential for serious confidentiality violations. Public Library of Science 2023-05-31 /pmc/articles/PMC10231782/ /pubmed/37256874 http://dx.doi.org/10.1371/journal.pone.0285552 Text en © 2023 Ajayakumar et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ajayakumar, Jayakrishnan Curtis, Andrew Curtis, Jacqueline The utility of Zip4 codes in spatial epidemiological analysis |
title | The utility of Zip4 codes in spatial epidemiological analysis |
title_full | The utility of Zip4 codes in spatial epidemiological analysis |
title_fullStr | The utility of Zip4 codes in spatial epidemiological analysis |
title_full_unstemmed | The utility of Zip4 codes in spatial epidemiological analysis |
title_short | The utility of Zip4 codes in spatial epidemiological analysis |
title_sort | utility of zip4 codes in spatial epidemiological analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10231782/ https://www.ncbi.nlm.nih.gov/pubmed/37256874 http://dx.doi.org/10.1371/journal.pone.0285552 |
work_keys_str_mv | AT ajayakumarjayakrishnan theutilityofzip4codesinspatialepidemiologicalanalysis AT curtisandrew theutilityofzip4codesinspatialepidemiologicalanalysis AT curtisjacqueline theutilityofzip4codesinspatialepidemiologicalanalysis AT ajayakumarjayakrishnan utilityofzip4codesinspatialepidemiologicalanalysis AT curtisandrew utilityofzip4codesinspatialepidemiologicalanalysis AT curtisjacqueline utilityofzip4codesinspatialepidemiologicalanalysis |