Cargando…
Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792414/ https://www.ncbi.nlm.nih.gov/pubmed/36333542 http://dx.doi.org/10.1007/s10654-022-00934-w |
_version_ | 1784859630064631808 |
---|---|
author | Shaw, Richard J. Harron, Katie L. Pescarini, Julia M. Pinto Junior, Elzo Pereira Allik, Mirjam Siroky, Andressa N. Campbell, Desmond Dundas, Ruth Ichihara, Maria Yury Leyland, Alastair H. Barreto, Mauricio L. Katikireddi, Srinivasa Vittal |
author_facet | Shaw, Richard J. Harron, Katie L. Pescarini, Julia M. Pinto Junior, Elzo Pereira Allik, Mirjam Siroky, Andressa N. Campbell, Desmond Dundas, Ruth Ichihara, Maria Yury Leyland, Alastair H. Barreto, Mauricio L. Katikireddi, Srinivasa Vittal |
author_sort | Shaw, Richard J. |
collection | PubMed |
description | Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and planning, provision and monitoring of services. The processes involved in generating and linking administrative datasets may generate sources of bias that are often not adequately considered by researchers. We provide a framework describing these biases, drawing on our experiences of using the 100 Million Brazilian Cohort (100MCohort) which contains records of more than 131 million people whose families applied for social assistance between 2001 and 2018. Datasets for epidemiological research were derived by linking the 100MCohort to health-related databases such as the Mortality Information System and the Hospital Information System. Using the framework, we demonstrate how selection and misclassification biases may be introduced in three different stages: registering and recording of people’s life events and use of services, linkage across administrative databases, and cleaning and coding of variables from derived datasets. Finally, we suggest eight recommendations which may reduce biases when analysing data from administrative sources. |
format | Online Article Text |
id | pubmed-9792414 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-97924142022-12-28 Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses Shaw, Richard J. Harron, Katie L. Pescarini, Julia M. Pinto Junior, Elzo Pereira Allik, Mirjam Siroky, Andressa N. Campbell, Desmond Dundas, Ruth Ichihara, Maria Yury Leyland, Alastair H. Barreto, Mauricio L. Katikireddi, Srinivasa Vittal Eur J Epidemiol Methods Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and planning, provision and monitoring of services. The processes involved in generating and linking administrative datasets may generate sources of bias that are often not adequately considered by researchers. We provide a framework describing these biases, drawing on our experiences of using the 100 Million Brazilian Cohort (100MCohort) which contains records of more than 131 million people whose families applied for social assistance between 2001 and 2018. Datasets for epidemiological research were derived by linking the 100MCohort to health-related databases such as the Mortality Information System and the Hospital Information System. Using the framework, we demonstrate how selection and misclassification biases may be introduced in three different stages: registering and recording of people’s life events and use of services, linkage across administrative databases, and cleaning and coding of variables from derived datasets. Finally, we suggest eight recommendations which may reduce biases when analysing data from administrative sources. Springer Netherlands 2022-11-05 2022 /pmc/articles/PMC9792414/ /pubmed/36333542 http://dx.doi.org/10.1007/s10654-022-00934-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Methods Shaw, Richard J. Harron, Katie L. Pescarini, Julia M. Pinto Junior, Elzo Pereira Allik, Mirjam Siroky, Andressa N. Campbell, Desmond Dundas, Ruth Ichihara, Maria Yury Leyland, Alastair H. Barreto, Mauricio L. Katikireddi, Srinivasa Vittal Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
title | Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
title_full | Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
title_fullStr | Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
title_full_unstemmed | Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
title_short | Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
title_sort | biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792414/ https://www.ncbi.nlm.nih.gov/pubmed/36333542 http://dx.doi.org/10.1007/s10654-022-00934-w |
work_keys_str_mv | AT shawrichardj biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT harronkatiel biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT pescarinijuliam biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT pintojuniorelzopereira biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT allikmirjam biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT sirokyandressan biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT campbelldesmond biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT dundasruth biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT ichiharamariayury biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT leylandalastairh biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT barretomauriciol biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses AT katikireddisrinivasavittal biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses |