Cargando…

Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015

INTRODUCTION: In Brazil, the National Health System (SUS) provides healthcare to the public. The system has multiple administrative databases; the major databases record hospital (SIH) and outpatient (SIA) procedures. Epidemiological information is collected for all populations in subsystems, such a...

Descripción completa

Detalles Bibliográficos
Autores principales: Guerra Junior, Augusto Afonso, Pereira, Ramon Gonçalves, Gurgel, Eli Iola, Cherchiglia, Mariangela, Dias, Leonardo Vinicius, Ávila, Juliano D, Santos, Núbia, Reis, Afonso, Acurcio, Francisco Assis, Meira Junior, Wagner
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Swansea University 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8142958/
https://www.ncbi.nlm.nih.gov/pubmed/34095519
http://dx.doi.org/10.23889/ijpds.v3i1.446
_version_ 1783696655194783744
author Guerra Junior, Augusto Afonso
Pereira, Ramon Gonçalves
Gurgel, Eli Iola
Cherchiglia, Mariangela
Dias, Leonardo Vinicius
Ávila, Juliano D
Santos, Núbia
Reis, Afonso
Acurcio, Francisco Assis
Meira Junior, Wagner
author_facet Guerra Junior, Augusto Afonso
Pereira, Ramon Gonçalves
Gurgel, Eli Iola
Cherchiglia, Mariangela
Dias, Leonardo Vinicius
Ávila, Juliano D
Santos, Núbia
Reis, Afonso
Acurcio, Francisco Assis
Meira Junior, Wagner
author_sort Guerra Junior, Augusto Afonso
collection PubMed
description INTRODUCTION: In Brazil, the National Health System (SUS) provides healthcare to the public. The system has multiple administrative databases; the major databases record hospital (SIH) and outpatient (SIA) procedures. Epidemiological information is collected for all populations in subsystems, such as mortality (SIM), live births (SINASC) and diseases of compulsory declaration (SINAN). Each subsystem has its own information system, which is able to provide information about consultations, clinical information and medicines dispensed. However, these systems are not linked, thereby preventing individual-centred analysis. OBJECTIVE: To describe the methods and results of parameter setting that are needed to execute the probabilistic deduplication of large administrative and epidemiological databases in Brazil and to create a National Health Database Centred on the individual. METHODS: This paper shows the results of a record linkage model to integrate data from SIH, SIA, SIM, and SINAN, which have different formats and attributes between them and over time. These data consist of 1.3 billion records from 2000-2015. Probabilistic and deterministic record linkages were used to deduplicate these data. The Kappa statistic and clerical review were used to ensure the quality of the linkage. The graph algorithm and depth-first search were used to generate the identifiers. RESULTS: The deterministic deduplication process resulted in a database with 403,113,527 possible unique individuals. After the probabilistic deduplication process of the former database was performed, 159,703,805 unique individuals were identified. This result had an estimated a false positive error rate of 3.3%, and the false negative error was estimated at 12.3%. CONCLUSIONS: The National Health Database centred on the individual was generated and will allow researchers to use real-world evidence to conduct clinical, epidemiological, economic and other studies. This database represents a significant cohort, spanning 15 years of historical data and preserving patient privacy. The success of the process described will allow repeating and appending the data for future years and enable important studies to promote SUS efficiency and provide better treatments for patients.
format Online
Article
Text
id pubmed-8142958
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Swansea University
record_format MEDLINE/PubMed
spelling pubmed-81429582021-06-04 Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 Guerra Junior, Augusto Afonso Pereira, Ramon Gonçalves Gurgel, Eli Iola Cherchiglia, Mariangela Dias, Leonardo Vinicius Ávila, Juliano D Santos, Núbia Reis, Afonso Acurcio, Francisco Assis Meira Junior, Wagner Int J Popul Data Sci Population Data Science INTRODUCTION: In Brazil, the National Health System (SUS) provides healthcare to the public. The system has multiple administrative databases; the major databases record hospital (SIH) and outpatient (SIA) procedures. Epidemiological information is collected for all populations in subsystems, such as mortality (SIM), live births (SINASC) and diseases of compulsory declaration (SINAN). Each subsystem has its own information system, which is able to provide information about consultations, clinical information and medicines dispensed. However, these systems are not linked, thereby preventing individual-centred analysis. OBJECTIVE: To describe the methods and results of parameter setting that are needed to execute the probabilistic deduplication of large administrative and epidemiological databases in Brazil and to create a National Health Database Centred on the individual. METHODS: This paper shows the results of a record linkage model to integrate data from SIH, SIA, SIM, and SINAN, which have different formats and attributes between them and over time. These data consist of 1.3 billion records from 2000-2015. Probabilistic and deterministic record linkages were used to deduplicate these data. The Kappa statistic and clerical review were used to ensure the quality of the linkage. The graph algorithm and depth-first search were used to generate the identifiers. RESULTS: The deterministic deduplication process resulted in a database with 403,113,527 possible unique individuals. After the probabilistic deduplication process of the former database was performed, 159,703,805 unique individuals were identified. This result had an estimated a false positive error rate of 3.3%, and the false negative error was estimated at 12.3%. CONCLUSIONS: The National Health Database centred on the individual was generated and will allow researchers to use real-world evidence to conduct clinical, epidemiological, economic and other studies. This database represents a significant cohort, spanning 15 years of historical data and preserving patient privacy. The success of the process described will allow repeating and appending the data for future years and enable important studies to promote SUS efficiency and provide better treatments for patients. Swansea University 2018-11-14 /pmc/articles/PMC8142958/ /pubmed/34095519 http://dx.doi.org/10.23889/ijpds.v3i1.446 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle Population Data Science
Guerra Junior, Augusto Afonso
Pereira, Ramon Gonçalves
Gurgel, Eli Iola
Cherchiglia, Mariangela
Dias, Leonardo Vinicius
Ávila, Juliano D
Santos, Núbia
Reis, Afonso
Acurcio, Francisco Assis
Meira Junior, Wagner
Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
title Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
title_full Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
title_fullStr Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
title_full_unstemmed Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
title_short Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
title_sort building the national database of health centred on the individual: administrative and epidemiological record linkage - brazil, 2000-2015
topic Population Data Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8142958/
https://www.ncbi.nlm.nih.gov/pubmed/34095519
http://dx.doi.org/10.23889/ijpds.v3i1.446
work_keys_str_mv AT guerrajunioraugustoafonso buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT pereiraramongoncalves buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT gurgeleliiola buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT cherchigliamariangela buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT diasleonardovinicius buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT avilajulianod buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT santosnubia buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT reisafonso buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT acurciofranciscoassis buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015
AT meirajuniorwagner buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015