Cargando…

Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers

BACKGROUND: Linkage of aged care and hospitalisation data provides valuable information on patterns of health service utilisation among aged care service recipients. Many aged care datasets in Australia contain a Statistical Linkage Key (SLK-581) instead of full personal identifiers. We linked hospi...

Descripción completa

Detalles Bibliográficos
Autores principales: Taylor, Lee K, Irvine, Katie, Iannotti, Renee, Harchak, Taylor, Lim, Kim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236530/
https://www.ncbi.nlm.nih.gov/pubmed/25257549
http://dx.doi.org/10.1186/1472-6947-14-85
_version_ 1782345183590875136
author Taylor, Lee K
Irvine, Katie
Iannotti, Renee
Harchak, Taylor
Lim, Kim
author_facet Taylor, Lee K
Irvine, Katie
Iannotti, Renee
Harchak, Taylor
Lim, Kim
author_sort Taylor, Lee K
collection PubMed
description BACKGROUND: Linkage of aged care and hospitalisation data provides valuable information on patterns of health service utilisation among aged care service recipients. Many aged care datasets in Australia contain a Statistical Linkage Key (SLK-581) instead of full personal identifiers. We linked hospital and death records using a full probabilistic strategy, the SLK-581, and three combined strategies; and compared results for each strategy. METHODS: Linkage of Admitted Patient Data for 2000–01 to 2008–09 and Registry of Births, Deaths and Marriages death registration data for 2008–09 for New South Wales, Australia, was carried out using probabilistic methods and compared to links created using four strategies incorporating a SLK-581. The Basic SLK-581 strategy used the SLK-581 alone. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies leveraged probabilistic links between hospital records drawn from the Centre for Health Record Linkage Master Linkage Key. Rates of hospitalisations among people who died were calculated for each strategy and a range of health conditions. RESULTS: Compared to full probabilistic linkage, the basic SLK-581 strategy produced substantial rates of missed links that increased over the study period and produced underestimates of hospitalisation rates that varied by health condition. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies resulted in substantially lower rates of underestimation than the Basic SLK-581. The Any Match SLK-581 strategy gave results closest to full probabilistic linkage. CONCLUSIONS: Hospitalisation rates prior to death are substantially underestimated by linkage using a SLK-581 alone. Linkage rates can be increased by combining deterministic methods with probabilistically created links across hospital records.
format Online
Article
Text
id pubmed-4236530
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42365302014-11-19 Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers Taylor, Lee K Irvine, Katie Iannotti, Renee Harchak, Taylor Lim, Kim BMC Med Inform Decis Mak Research Article BACKGROUND: Linkage of aged care and hospitalisation data provides valuable information on patterns of health service utilisation among aged care service recipients. Many aged care datasets in Australia contain a Statistical Linkage Key (SLK-581) instead of full personal identifiers. We linked hospital and death records using a full probabilistic strategy, the SLK-581, and three combined strategies; and compared results for each strategy. METHODS: Linkage of Admitted Patient Data for 2000–01 to 2008–09 and Registry of Births, Deaths and Marriages death registration data for 2008–09 for New South Wales, Australia, was carried out using probabilistic methods and compared to links created using four strategies incorporating a SLK-581. The Basic SLK-581 strategy used the SLK-581 alone. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies leveraged probabilistic links between hospital records drawn from the Centre for Health Record Linkage Master Linkage Key. Rates of hospitalisations among people who died were calculated for each strategy and a range of health conditions. RESULTS: Compared to full probabilistic linkage, the basic SLK-581 strategy produced substantial rates of missed links that increased over the study period and produced underestimates of hospitalisation rates that varied by health condition. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies resulted in substantially lower rates of underestimation than the Basic SLK-581. The Any Match SLK-581 strategy gave results closest to full probabilistic linkage. CONCLUSIONS: Hospitalisation rates prior to death are substantially underestimated by linkage using a SLK-581 alone. Linkage rates can be increased by combining deterministic methods with probabilistically created links across hospital records. BioMed Central 2014-09-25 /pmc/articles/PMC4236530/ /pubmed/25257549 http://dx.doi.org/10.1186/1472-6947-14-85 Text en Copyright © 2014 Taylor et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Taylor, Lee K
Irvine, Katie
Iannotti, Renee
Harchak, Taylor
Lim, Kim
Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
title Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
title_full Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
title_fullStr Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
title_full_unstemmed Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
title_short Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
title_sort optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236530/
https://www.ncbi.nlm.nih.gov/pubmed/25257549
http://dx.doi.org/10.1186/1472-6947-14-85
work_keys_str_mv AT taylorleek optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers
AT irvinekatie optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers
AT iannottirenee optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers
AT harchaktaylor optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers
AT limkim optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers