Cargando…
Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers
BACKGROUND: Linkage of aged care and hospitalisation data provides valuable information on patterns of health service utilisation among aged care service recipients. Many aged care datasets in Australia contain a Statistical Linkage Key (SLK-581) instead of full personal identifiers. We linked hospi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236530/ https://www.ncbi.nlm.nih.gov/pubmed/25257549 http://dx.doi.org/10.1186/1472-6947-14-85 |
_version_ | 1782345183590875136 |
---|---|
author | Taylor, Lee K Irvine, Katie Iannotti, Renee Harchak, Taylor Lim, Kim |
author_facet | Taylor, Lee K Irvine, Katie Iannotti, Renee Harchak, Taylor Lim, Kim |
author_sort | Taylor, Lee K |
collection | PubMed |
description | BACKGROUND: Linkage of aged care and hospitalisation data provides valuable information on patterns of health service utilisation among aged care service recipients. Many aged care datasets in Australia contain a Statistical Linkage Key (SLK-581) instead of full personal identifiers. We linked hospital and death records using a full probabilistic strategy, the SLK-581, and three combined strategies; and compared results for each strategy. METHODS: Linkage of Admitted Patient Data for 2000–01 to 2008–09 and Registry of Births, Deaths and Marriages death registration data for 2008–09 for New South Wales, Australia, was carried out using probabilistic methods and compared to links created using four strategies incorporating a SLK-581. The Basic SLK-581 strategy used the SLK-581 alone. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies leveraged probabilistic links between hospital records drawn from the Centre for Health Record Linkage Master Linkage Key. Rates of hospitalisations among people who died were calculated for each strategy and a range of health conditions. RESULTS: Compared to full probabilistic linkage, the basic SLK-581 strategy produced substantial rates of missed links that increased over the study period and produced underestimates of hospitalisation rates that varied by health condition. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies resulted in substantially lower rates of underestimation than the Basic SLK-581. The Any Match SLK-581 strategy gave results closest to full probabilistic linkage. CONCLUSIONS: Hospitalisation rates prior to death are substantially underestimated by linkage using a SLK-581 alone. Linkage rates can be increased by combining deterministic methods with probabilistically created links across hospital records. |
format | Online Article Text |
id | pubmed-4236530 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42365302014-11-19 Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers Taylor, Lee K Irvine, Katie Iannotti, Renee Harchak, Taylor Lim, Kim BMC Med Inform Decis Mak Research Article BACKGROUND: Linkage of aged care and hospitalisation data provides valuable information on patterns of health service utilisation among aged care service recipients. Many aged care datasets in Australia contain a Statistical Linkage Key (SLK-581) instead of full personal identifiers. We linked hospital and death records using a full probabilistic strategy, the SLK-581, and three combined strategies; and compared results for each strategy. METHODS: Linkage of Admitted Patient Data for 2000–01 to 2008–09 and Registry of Births, Deaths and Marriages death registration data for 2008–09 for New South Wales, Australia, was carried out using probabilistic methods and compared to links created using four strategies incorporating a SLK-581. The Basic SLK-581 strategy used the SLK-581 alone. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies leveraged probabilistic links between hospital records drawn from the Centre for Health Record Linkage Master Linkage Key. Rates of hospitalisations among people who died were calculated for each strategy and a range of health conditions. RESULTS: Compared to full probabilistic linkage, the basic SLK-581 strategy produced substantial rates of missed links that increased over the study period and produced underestimates of hospitalisation rates that varied by health condition. The Most Recent SLK-581, Most Frequent SLK-581, and Any Match SLK-581 strategies resulted in substantially lower rates of underestimation than the Basic SLK-581. The Any Match SLK-581 strategy gave results closest to full probabilistic linkage. CONCLUSIONS: Hospitalisation rates prior to death are substantially underestimated by linkage using a SLK-581 alone. Linkage rates can be increased by combining deterministic methods with probabilistically created links across hospital records. BioMed Central 2014-09-25 /pmc/articles/PMC4236530/ /pubmed/25257549 http://dx.doi.org/10.1186/1472-6947-14-85 Text en Copyright © 2014 Taylor et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Taylor, Lee K Irvine, Katie Iannotti, Renee Harchak, Taylor Lim, Kim Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
title | Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
title_full | Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
title_fullStr | Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
title_full_unstemmed | Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
title_short | Optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
title_sort | optimal strategy for linkage of datasets containing a statistical linkage key and datasets with full personal identifiers |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236530/ https://www.ncbi.nlm.nih.gov/pubmed/25257549 http://dx.doi.org/10.1186/1472-6947-14-85 |
work_keys_str_mv | AT taylorleek optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers AT irvinekatie optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers AT iannottirenee optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers AT harchaktaylor optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers AT limkim optimalstrategyforlinkageofdatasetscontainingastatisticallinkagekeyanddatasetswithfullpersonalidentifiers |