Cargando…

Assessing record linkage between health care and Vital Statistics databases using deterministic methods

BACKGROUND: We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry. METHODS: Three combinations of four personal identifiers (surname, fir...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Bing, Quan, Hude, Fong, Andrew, Lu, Mingshan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1534029/
https://www.ncbi.nlm.nih.gov/pubmed/16597337
http://dx.doi.org/10.1186/1472-6963-6-48
_version_ 1782129090552135680
author Li, Bing
Quan, Hude
Fong, Andrew
Lu, Mingshan
author_facet Li, Bing
Quan, Hude
Fong, Andrew
Lu, Mingshan
author_sort Li, Bing
collection PubMed
description BACKGROUND: We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry. METHODS: Three combinations of four personal identifiers (surname, first name, sex and date of birth) were used to determine the optimal combination. The correct linkage rate was assessed using a unique personal health number available in all three databases. RESULTS: Among the three combinations, the combination of surname, sex, and date of birth had the highest linkage rate of 88.0% and 93.1%, and the second highest correct linkage rate of 96.9% and 98.9% between the population registry and Vital Statistics registry, and between the hospital discharge data and Vital Statistics registry in 2001, respectively. Adding the first name to the combination of the three identifiers above increased correct linkage by less than 1%, but at the cost of lowering the linkage rate almost by 10%. CONCLUSION: Our findings suggest that the combination of surname, sex and date of birth appears to be optimal using deterministic linkage. The linkage and correct linkage rates appear to vary by age and the type of database, but not by sex.
format Text
id pubmed-1534029
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15340292006-08-09 Assessing record linkage between health care and Vital Statistics databases using deterministic methods Li, Bing Quan, Hude Fong, Andrew Lu, Mingshan BMC Health Serv Res Research Article BACKGROUND: We assessed the linkage and correct linkage rate using deterministic record linkage among three commonly used Canadian databases, namely, the population registry, hospital discharge data and Vital Statistics registry. METHODS: Three combinations of four personal identifiers (surname, first name, sex and date of birth) were used to determine the optimal combination. The correct linkage rate was assessed using a unique personal health number available in all three databases. RESULTS: Among the three combinations, the combination of surname, sex, and date of birth had the highest linkage rate of 88.0% and 93.1%, and the second highest correct linkage rate of 96.9% and 98.9% between the population registry and Vital Statistics registry, and between the hospital discharge data and Vital Statistics registry in 2001, respectively. Adding the first name to the combination of the three identifiers above increased correct linkage by less than 1%, but at the cost of lowering the linkage rate almost by 10%. CONCLUSION: Our findings suggest that the combination of surname, sex and date of birth appears to be optimal using deterministic linkage. The linkage and correct linkage rates appear to vary by age and the type of database, but not by sex. BioMed Central 2006-04-05 /pmc/articles/PMC1534029/ /pubmed/16597337 http://dx.doi.org/10.1186/1472-6963-6-48 Text en Copyright © 2006 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Bing
Quan, Hude
Fong, Andrew
Lu, Mingshan
Assessing record linkage between health care and Vital Statistics databases using deterministic methods
title Assessing record linkage between health care and Vital Statistics databases using deterministic methods
title_full Assessing record linkage between health care and Vital Statistics databases using deterministic methods
title_fullStr Assessing record linkage between health care and Vital Statistics databases using deterministic methods
title_full_unstemmed Assessing record linkage between health care and Vital Statistics databases using deterministic methods
title_short Assessing record linkage between health care and Vital Statistics databases using deterministic methods
title_sort assessing record linkage between health care and vital statistics databases using deterministic methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1534029/
https://www.ncbi.nlm.nih.gov/pubmed/16597337
http://dx.doi.org/10.1186/1472-6963-6-48
work_keys_str_mv AT libing assessingrecordlinkagebetweenhealthcareandvitalstatisticsdatabasesusingdeterministicmethods
AT quanhude assessingrecordlinkagebetweenhealthcareandvitalstatisticsdatabasesusingdeterministicmethods
AT fongandrew assessingrecordlinkagebetweenhealthcareandvitalstatisticsdatabasesusingdeterministicmethods
AT lumingshan assessingrecordlinkagebetweenhealthcareandvitalstatisticsdatabasesusingdeterministicmethods