Cargando…

Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort

BACKGROUND: Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in...

Descripción completa

Detalles Bibliográficos
Autores principales: Almeida, Daniela, Gorender, David, Ichihara, Maria Yury, Sena, Samila, Menezes, Luan, Barbosa, George C. G., Fiaccone, Rosimeire L., Paixão, Enny S., Pita, Robespierre, Barreto, Mauricio L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382864/
https://www.ncbi.nlm.nih.gov/pubmed/32711532
http://dx.doi.org/10.1186/s12911-020-01192-0
_version_ 1783563335173668864
author Almeida, Daniela
Gorender, David
Ichihara, Maria Yury
Sena, Samila
Menezes, Luan
Barbosa, George C. G.
Fiaccone, Rosimeire L.
Paixão, Enny S.
Pita, Robespierre
Barreto, Mauricio L.
author_facet Almeida, Daniela
Gorender, David
Ichihara, Maria Yury
Sena, Samila
Menezes, Luan
Barbosa, George C. G.
Fiaccone, Rosimeire L.
Paixão, Enny S.
Pita, Robespierre
Barreto, Mauricio L.
author_sort Almeida, Daniela
collection PubMed
description BACKGROUND: Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals. METHODS: We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias. RESULTS: A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets. DISCUSSION: We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset.
format Online
Article
Text
id pubmed-7382864
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73828642020-07-28 Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort Almeida, Daniela Gorender, David Ichihara, Maria Yury Sena, Samila Menezes, Luan Barbosa, George C. G. Fiaccone, Rosimeire L. Paixão, Enny S. Pita, Robespierre Barreto, Mauricio L. BMC Med Inform Decis Mak Research Article BACKGROUND: Research using linked routine population-based data collected for non-research purposes has increased in recent years because they are a rich and detailed source of information. The objective of this study is to present an approach to prepare and link data from administrative sources in a middle-income country, to estimate its quality and to identify potential sources of bias by comparing linked and non-linked individuals. METHODS: We linked two administrative datasets with data covering the period 2001 to 2015, using maternal attributes (name, age, date of birth, and municipally of residence) from Brazil: live birth information system and the 100 Million Brazilian Cohort (created using administrative records from over 114 million individuals whose families applied for social assistance via the Unified Register for Social Programmes) implementing an in house developed linkage tool CIDACS-RL. We then estimated the proportion of highly probably link and examined the characteristics of missed-matches to identify any potential source of bias. RESULTS: A total of 27,699,891 live births were submited to linkage with maternal information recorded in the baseline of the 100 Million Brazilian Cohort dataset of those, 16,447,414 (59.4%) children were found registered in the 100 Million Brazilian Cohort dataset. The proportion of highly probably link ranged from 39.3% in 2001 to 82.1% in 2014. A substantial improvement in the linkage after the introduction of maternal date of birth attribute, in 2011, was observed. Our analyses indicated a slightly higher proportion of missing data among missed matches and a higher proportion of people living in an urban area and self-declared as Caucasian among linked pairs when compared with non-linked sets. DISCUSSION: We demonstrated that CIDACS-RL is capable of performing high quality linkage even with a limited number of common attributes, using indexation as a blocking strategy in larg e routine databases from a middle-income country. However, residual records occurred more among people under worse living conditions. The results presented in this study reinforce the need of evaluating linkage quality and when necessary to take linkage error into account for the analyses of any generated dataset. BioMed Central 2020-07-25 /pmc/articles/PMC7382864/ /pubmed/32711532 http://dx.doi.org/10.1186/s12911-020-01192-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Almeida, Daniela
Gorender, David
Ichihara, Maria Yury
Sena, Samila
Menezes, Luan
Barbosa, George C. G.
Fiaccone, Rosimeire L.
Paixão, Enny S.
Pita, Robespierre
Barreto, Mauricio L.
Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
title Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
title_full Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
title_fullStr Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
title_full_unstemmed Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
title_short Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort
title_sort examining the quality of record linkage process using nationwide brazilian administrative databases to build a large birth cohort
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7382864/
https://www.ncbi.nlm.nih.gov/pubmed/32711532
http://dx.doi.org/10.1186/s12911-020-01192-0
work_keys_str_mv AT almeidadaniela examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT gorenderdavid examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT ichiharamariayury examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT senasamila examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT menezesluan examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT barbosageorgecg examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT fiacconerosimeirel examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT paixaoennys examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT pitarobespierre examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort
AT barretomauriciol examiningthequalityofrecordlinkageprocessusingnationwidebrazilianadministrativedatabasestobuildalargebirthcohort