Cargando…

Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences

Aim: This case study aimed to investigate the process of integrating resources of multiple biobanks and health-care registers, especially addressing data permit application, time schedules, co-operation of stakeholders, data exchange and data quality. Methods: We investigated the process in the cont...

Descripción completa

Detalles Bibliográficos
Autores principales: Lähteenmäki, Jaakko, Vuorinen, Anna-Leena, Pajula, Juha, Harno, Kari, Lehto, Mika, Niemi, Mikko, Van Gils, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9152591/
https://www.ncbi.nlm.nih.gov/pubmed/33845693
http://dx.doi.org/10.1177/14034948211004421
_version_ 1784717684588412928
author Lähteenmäki, Jaakko
Vuorinen, Anna-Leena
Pajula, Juha
Harno, Kari
Lehto, Mika
Niemi, Mikko
Van Gils, Mark
author_facet Lähteenmäki, Jaakko
Vuorinen, Anna-Leena
Pajula, Juha
Harno, Kari
Lehto, Mika
Niemi, Mikko
Van Gils, Mark
author_sort Lähteenmäki, Jaakko
collection PubMed
description Aim: This case study aimed to investigate the process of integrating resources of multiple biobanks and health-care registers, especially addressing data permit application, time schedules, co-operation of stakeholders, data exchange and data quality. Methods: We investigated the process in the context of a retrospective study: Pharmacogenomics of antithrombotic drugs (PreMed study). The study involved linking the genotype data of three Finnish biobanks (Auria Biobank, Helsinki Biobank and THL Biobank) with register data on medicine dispensations, health-care encounters and laboratory results. Results: We managed to collect a cohort of 7005 genotyped individuals, thereby achieving the statistical power requirements of the study. The data collection process took 16 months, exceeding our original estimate by seven months. The main delays were caused by the congested data permit approval service to access national register data on health-care encounters. Comparison of hospital data lakes and national registers revealed differences, especially concerning medication data. Genetic variant frequencies were in line with earlier data reported for the European population. The yearly number of international normalised ratio (INR) tests showed stable behaviour over time. Conclusions: A large cohort, consisting of versatile individual-level phenotype and genotype data, can be constructed by integrating data from several biobanks and health data registers in Finland. Co-operation with biobanks is straightforward. However, long time periods need to be reserved when biobank resources are linked with national register data. There is a need for efforts to define general, harmonised co-operation practices and data exchange methods for enabling efficient collection of data from multiple sources.
format Online
Article
Text
id pubmed-9152591
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-91525912022-06-01 Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences Lähteenmäki, Jaakko Vuorinen, Anna-Leena Pajula, Juha Harno, Kari Lehto, Mika Niemi, Mikko Van Gils, Mark Scand J Public Health Original Articles Aim: This case study aimed to investigate the process of integrating resources of multiple biobanks and health-care registers, especially addressing data permit application, time schedules, co-operation of stakeholders, data exchange and data quality. Methods: We investigated the process in the context of a retrospective study: Pharmacogenomics of antithrombotic drugs (PreMed study). The study involved linking the genotype data of three Finnish biobanks (Auria Biobank, Helsinki Biobank and THL Biobank) with register data on medicine dispensations, health-care encounters and laboratory results. Results: We managed to collect a cohort of 7005 genotyped individuals, thereby achieving the statistical power requirements of the study. The data collection process took 16 months, exceeding our original estimate by seven months. The main delays were caused by the congested data permit approval service to access national register data on health-care encounters. Comparison of hospital data lakes and national registers revealed differences, especially concerning medication data. Genetic variant frequencies were in line with earlier data reported for the European population. The yearly number of international normalised ratio (INR) tests showed stable behaviour over time. Conclusions: A large cohort, consisting of versatile individual-level phenotype and genotype data, can be constructed by integrating data from several biobanks and health data registers in Finland. Co-operation with biobanks is straightforward. However, long time periods need to be reserved when biobank resources are linked with national register data. There is a need for efforts to define general, harmonised co-operation practices and data exchange methods for enabling efficient collection of data from multiple sources. SAGE Publications 2021-04-12 2022-06 /pmc/articles/PMC9152591/ /pubmed/33845693 http://dx.doi.org/10.1177/14034948211004421 Text en © Author(s) 2021 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Articles
Lähteenmäki, Jaakko
Vuorinen, Anna-Leena
Pajula, Juha
Harno, Kari
Lehto, Mika
Niemi, Mikko
Van Gils, Mark
Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences
title Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences
title_full Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences
title_fullStr Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences
title_full_unstemmed Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences
title_short Integrating data from multiple Finnish biobanks and national health-care registers for retrospective studies: Practical experiences
title_sort integrating data from multiple finnish biobanks and national health-care registers for retrospective studies: practical experiences
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9152591/
https://www.ncbi.nlm.nih.gov/pubmed/33845693
http://dx.doi.org/10.1177/14034948211004421
work_keys_str_mv AT lahteenmakijaakko integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences
AT vuorinenannaleena integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences
AT pajulajuha integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences
AT harnokari integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences
AT lehtomika integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences
AT niemimikko integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences
AT vangilsmark integratingdatafrommultiplefinnishbiobanksandnationalhealthcareregistersforretrospectivestudiespracticalexperiences