Cargando…
Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015
INTRODUCTION: In Brazil, the National Health System (SUS) provides healthcare to the public. The system has multiple administrative databases; the major databases record hospital (SIH) and outpatient (SIA) procedures. Epidemiological information is collected for all populations in subsystems, such a...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Swansea University
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8142958/ https://www.ncbi.nlm.nih.gov/pubmed/34095519 http://dx.doi.org/10.23889/ijpds.v3i1.446 |
_version_ | 1783696655194783744 |
---|---|
author | Guerra Junior, Augusto Afonso Pereira, Ramon Gonçalves Gurgel, Eli Iola Cherchiglia, Mariangela Dias, Leonardo Vinicius Ávila, Juliano D Santos, Núbia Reis, Afonso Acurcio, Francisco Assis Meira Junior, Wagner |
author_facet | Guerra Junior, Augusto Afonso Pereira, Ramon Gonçalves Gurgel, Eli Iola Cherchiglia, Mariangela Dias, Leonardo Vinicius Ávila, Juliano D Santos, Núbia Reis, Afonso Acurcio, Francisco Assis Meira Junior, Wagner |
author_sort | Guerra Junior, Augusto Afonso |
collection | PubMed |
description | INTRODUCTION: In Brazil, the National Health System (SUS) provides healthcare to the public. The system has multiple administrative databases; the major databases record hospital (SIH) and outpatient (SIA) procedures. Epidemiological information is collected for all populations in subsystems, such as mortality (SIM), live births (SINASC) and diseases of compulsory declaration (SINAN). Each subsystem has its own information system, which is able to provide information about consultations, clinical information and medicines dispensed. However, these systems are not linked, thereby preventing individual-centred analysis. OBJECTIVE: To describe the methods and results of parameter setting that are needed to execute the probabilistic deduplication of large administrative and epidemiological databases in Brazil and to create a National Health Database Centred on the individual. METHODS: This paper shows the results of a record linkage model to integrate data from SIH, SIA, SIM, and SINAN, which have different formats and attributes between them and over time. These data consist of 1.3 billion records from 2000-2015. Probabilistic and deterministic record linkages were used to deduplicate these data. The Kappa statistic and clerical review were used to ensure the quality of the linkage. The graph algorithm and depth-first search were used to generate the identifiers. RESULTS: The deterministic deduplication process resulted in a database with 403,113,527 possible unique individuals. After the probabilistic deduplication process of the former database was performed, 159,703,805 unique individuals were identified. This result had an estimated a false positive error rate of 3.3%, and the false negative error was estimated at 12.3%. CONCLUSIONS: The National Health Database centred on the individual was generated and will allow researchers to use real-world evidence to conduct clinical, epidemiological, economic and other studies. This database represents a significant cohort, spanning 15 years of historical data and preserving patient privacy. The success of the process described will allow repeating and appending the data for future years and enable important studies to promote SUS efficiency and provide better treatments for patients. |
format | Online Article Text |
id | pubmed-8142958 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Swansea University |
record_format | MEDLINE/PubMed |
spelling | pubmed-81429582021-06-04 Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 Guerra Junior, Augusto Afonso Pereira, Ramon Gonçalves Gurgel, Eli Iola Cherchiglia, Mariangela Dias, Leonardo Vinicius Ávila, Juliano D Santos, Núbia Reis, Afonso Acurcio, Francisco Assis Meira Junior, Wagner Int J Popul Data Sci Population Data Science INTRODUCTION: In Brazil, the National Health System (SUS) provides healthcare to the public. The system has multiple administrative databases; the major databases record hospital (SIH) and outpatient (SIA) procedures. Epidemiological information is collected for all populations in subsystems, such as mortality (SIM), live births (SINASC) and diseases of compulsory declaration (SINAN). Each subsystem has its own information system, which is able to provide information about consultations, clinical information and medicines dispensed. However, these systems are not linked, thereby preventing individual-centred analysis. OBJECTIVE: To describe the methods and results of parameter setting that are needed to execute the probabilistic deduplication of large administrative and epidemiological databases in Brazil and to create a National Health Database Centred on the individual. METHODS: This paper shows the results of a record linkage model to integrate data from SIH, SIA, SIM, and SINAN, which have different formats and attributes between them and over time. These data consist of 1.3 billion records from 2000-2015. Probabilistic and deterministic record linkages were used to deduplicate these data. The Kappa statistic and clerical review were used to ensure the quality of the linkage. The graph algorithm and depth-first search were used to generate the identifiers. RESULTS: The deterministic deduplication process resulted in a database with 403,113,527 possible unique individuals. After the probabilistic deduplication process of the former database was performed, 159,703,805 unique individuals were identified. This result had an estimated a false positive error rate of 3.3%, and the false negative error was estimated at 12.3%. CONCLUSIONS: The National Health Database centred on the individual was generated and will allow researchers to use real-world evidence to conduct clinical, epidemiological, economic and other studies. This database represents a significant cohort, spanning 15 years of historical data and preserving patient privacy. The success of the process described will allow repeating and appending the data for future years and enable important studies to promote SUS efficiency and provide better treatments for patients. Swansea University 2018-11-14 /pmc/articles/PMC8142958/ /pubmed/34095519 http://dx.doi.org/10.23889/ijpds.v3i1.446 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License. |
spellingShingle | Population Data Science Guerra Junior, Augusto Afonso Pereira, Ramon Gonçalves Gurgel, Eli Iola Cherchiglia, Mariangela Dias, Leonardo Vinicius Ávila, Juliano D Santos, Núbia Reis, Afonso Acurcio, Francisco Assis Meira Junior, Wagner Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 |
title | Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 |
title_full | Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 |
title_fullStr | Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 |
title_full_unstemmed | Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 |
title_short | Building the National Database of Health Centred on the Individual: Administrative and Epidemiological Record Linkage - Brazil, 2000-2015 |
title_sort | building the national database of health centred on the individual: administrative and epidemiological record linkage - brazil, 2000-2015 |
topic | Population Data Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8142958/ https://www.ncbi.nlm.nih.gov/pubmed/34095519 http://dx.doi.org/10.23889/ijpds.v3i1.446 |
work_keys_str_mv | AT guerrajunioraugustoafonso buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT pereiraramongoncalves buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT gurgeleliiola buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT cherchigliamariangela buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT diasleonardovinicius buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT avilajulianod buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT santosnubia buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT reisafonso buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT acurciofranciscoassis buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 AT meirajuniorwagner buildingthenationaldatabaseofhealthcentredontheindividualadministrativeandepidemiologicalrecordlinkagebrazil20002015 |