Cargando…

An unsupervised classification method for inferring original case locations from low-resolution disease maps

BACKGROUND: Widespread availability of geographic information systems software has facilitated the use of disease mapping in academia, government and private sector. Maps that display the address of affected patients are often exchanged in public forums, and published in peer-reviewed journal articl...

Descripción completa

Detalles Bibliográficos
Autores principales:	Brownstein, John S, Cassa, Christopher A, Kohane, Isaac S, Mandl, Kenneth D
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2006
Materias:	Methodology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1702538/ https://www.ncbi.nlm.nih.gov/pubmed/17156451 http://dx.doi.org/10.1186/1476-072X-5-56

_version_	1782131262410981376
author	Brownstein, John S Cassa, Christopher A Kohane, Isaac S Mandl, Kenneth D
author_facet	Brownstein, John S Cassa, Christopher A Kohane, Isaac S Mandl, Kenneth D
author_sort	Brownstein, John S
collection	PubMed
description	BACKGROUND: Widespread availability of geographic information systems software has facilitated the use of disease mapping in academia, government and private sector. Maps that display the address of affected patients are often exchanged in public forums, and published in peer-reviewed journal articles. As previously reported, a search of figure legends in five major medical journals found 19 articles from 1994–2004 that identify over 19,000 patient addresses. In this report, a method is presented to evaluate whether patient privacy is being breached in the publication of low-resolution disease maps. RESULTS: To demonstrate the effect, a hypothetical low-resolution map of geocoded patient addresses was created and the accuracy with which patient addresses can be resolved is described. Through georeferencing and unsupervised classification of the original image, the method precisely re-identified 26% (144/550) of the patient addresses from a presentation quality map and 79% (432/550) from a publication quality map. For the presentation quality map, 99.8% of the addresses were within 70 meters (approximately one city block length) of the predicted patient location, 51.6% of addresses were identified within five buildings, 70.7% within ten buildings and 93% within twenty buildings. For the publication quality map, all addresses were within 14 meters and 11 buildings of the predicted patient location. CONCLUSION: This study demonstrates that lowering the resolution of a map displaying geocoded patient addresses does not sufficiently protect patient addresses from re-identification. Guidelines to protect patient privacy, including those of medical journals, should reflect policies that ensure privacy protection when spatial data are displayed or published.
format	Text
id	pubmed-1702538
institution	National Center for Biotechnology Information
language	English
publishDate	2006
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-17025382006-12-16 An unsupervised classification method for inferring original case locations from low-resolution disease maps Brownstein, John S Cassa, Christopher A Kohane, Isaac S Mandl, Kenneth D Int J Health Geogr Methodology BACKGROUND: Widespread availability of geographic information systems software has facilitated the use of disease mapping in academia, government and private sector. Maps that display the address of affected patients are often exchanged in public forums, and published in peer-reviewed journal articles. As previously reported, a search of figure legends in five major medical journals found 19 articles from 1994–2004 that identify over 19,000 patient addresses. In this report, a method is presented to evaluate whether patient privacy is being breached in the publication of low-resolution disease maps. RESULTS: To demonstrate the effect, a hypothetical low-resolution map of geocoded patient addresses was created and the accuracy with which patient addresses can be resolved is described. Through georeferencing and unsupervised classification of the original image, the method precisely re-identified 26% (144/550) of the patient addresses from a presentation quality map and 79% (432/550) from a publication quality map. For the presentation quality map, 99.8% of the addresses were within 70 meters (approximately one city block length) of the predicted patient location, 51.6% of addresses were identified within five buildings, 70.7% within ten buildings and 93% within twenty buildings. For the publication quality map, all addresses were within 14 meters and 11 buildings of the predicted patient location. CONCLUSION: This study demonstrates that lowering the resolution of a map displaying geocoded patient addresses does not sufficiently protect patient addresses from re-identification. Guidelines to protect patient privacy, including those of medical journals, should reflect policies that ensure privacy protection when spatial data are displayed or published. BioMed Central 2006-12-08 /pmc/articles/PMC1702538/ /pubmed/17156451 http://dx.doi.org/10.1186/1476-072X-5-56 Text en Copyright © 2006 Brownstein et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Brownstein, John S Cassa, Christopher A Kohane, Isaac S Mandl, Kenneth D An unsupervised classification method for inferring original case locations from low-resolution disease maps
title	An unsupervised classification method for inferring original case locations from low-resolution disease maps
title_full	An unsupervised classification method for inferring original case locations from low-resolution disease maps
title_fullStr	An unsupervised classification method for inferring original case locations from low-resolution disease maps
title_full_unstemmed	An unsupervised classification method for inferring original case locations from low-resolution disease maps
title_short	An unsupervised classification method for inferring original case locations from low-resolution disease maps
title_sort	unsupervised classification method for inferring original case locations from low-resolution disease maps
topic	Methodology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1702538/ https://www.ncbi.nlm.nih.gov/pubmed/17156451 http://dx.doi.org/10.1186/1476-072X-5-56
work_keys_str_mv	AT brownsteinjohns anunsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT cassachristophera anunsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT kohaneisaacs anunsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT mandlkennethd anunsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT brownsteinjohns unsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT cassachristophera unsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT kohaneisaacs unsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps AT mandlkennethd unsupervisedclassificationmethodforinferringoriginalcaselocationsfromlowresolutiondiseasemaps

An unsupervised classification method for inferring original case locations from low-resolution disease maps

Ejemplares similares