Cargando…

Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses

Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaw, Richard J., Harron, Katie L., Pescarini, Julia M., Pinto Junior, Elzo Pereira, Allik, Mirjam, Siroky, Andressa N., Campbell, Desmond, Dundas, Ruth, Ichihara, Maria Yury, Leyland, Alastair H., Barreto, Mauricio L., Katikireddi, Srinivasa Vittal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792414/
https://www.ncbi.nlm.nih.gov/pubmed/36333542
http://dx.doi.org/10.1007/s10654-022-00934-w
_version_ 1784859630064631808
author Shaw, Richard J.
Harron, Katie L.
Pescarini, Julia M.
Pinto Junior, Elzo Pereira
Allik, Mirjam
Siroky, Andressa N.
Campbell, Desmond
Dundas, Ruth
Ichihara, Maria Yury
Leyland, Alastair H.
Barreto, Mauricio L.
Katikireddi, Srinivasa Vittal
author_facet Shaw, Richard J.
Harron, Katie L.
Pescarini, Julia M.
Pinto Junior, Elzo Pereira
Allik, Mirjam
Siroky, Andressa N.
Campbell, Desmond
Dundas, Ruth
Ichihara, Maria Yury
Leyland, Alastair H.
Barreto, Mauricio L.
Katikireddi, Srinivasa Vittal
author_sort Shaw, Richard J.
collection PubMed
description Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and planning, provision and monitoring of services. The processes involved in generating and linking administrative datasets may generate sources of bias that are often not adequately considered by researchers. We provide a framework describing these biases, drawing on our experiences of using the 100 Million Brazilian Cohort (100MCohort) which contains records of more than 131 million people whose families applied for social assistance between 2001 and 2018. Datasets for epidemiological research were derived by linking the 100MCohort to health-related databases such as the Mortality Information System and the Hospital Information System. Using the framework, we demonstrate how selection and misclassification biases may be introduced in three different stages: registering and recording of people’s life events and use of services, linkage across administrative databases, and cleaning and coding of variables from derived datasets. Finally, we suggest eight recommendations which may reduce biases when analysing data from administrative sources.
format Online
Article
Text
id pubmed-9792414
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-97924142022-12-28 Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses Shaw, Richard J. Harron, Katie L. Pescarini, Julia M. Pinto Junior, Elzo Pereira Allik, Mirjam Siroky, Andressa N. Campbell, Desmond Dundas, Ruth Ichihara, Maria Yury Leyland, Alastair H. Barreto, Mauricio L. Katikireddi, Srinivasa Vittal Eur J Epidemiol Methods Linked administrative data offer a rich source of information that can be harnessed to describe patterns of disease, understand their causes and evaluate interventions. However, administrative data are primarily collected for operational reasons such as recording vital events for legal purposes, and planning, provision and monitoring of services. The processes involved in generating and linking administrative datasets may generate sources of bias that are often not adequately considered by researchers. We provide a framework describing these biases, drawing on our experiences of using the 100 Million Brazilian Cohort (100MCohort) which contains records of more than 131 million people whose families applied for social assistance between 2001 and 2018. Datasets for epidemiological research were derived by linking the 100MCohort to health-related databases such as the Mortality Information System and the Hospital Information System. Using the framework, we demonstrate how selection and misclassification biases may be introduced in three different stages: registering and recording of people’s life events and use of services, linkage across administrative databases, and cleaning and coding of variables from derived datasets. Finally, we suggest eight recommendations which may reduce biases when analysing data from administrative sources. Springer Netherlands 2022-11-05 2022 /pmc/articles/PMC9792414/ /pubmed/36333542 http://dx.doi.org/10.1007/s10654-022-00934-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods
Shaw, Richard J.
Harron, Katie L.
Pescarini, Julia M.
Pinto Junior, Elzo Pereira
Allik, Mirjam
Siroky, Andressa N.
Campbell, Desmond
Dundas, Ruth
Ichihara, Maria Yury
Leyland, Alastair H.
Barreto, Mauricio L.
Katikireddi, Srinivasa Vittal
Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
title Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
title_full Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
title_fullStr Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
title_full_unstemmed Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
title_short Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
title_sort biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9792414/
https://www.ncbi.nlm.nih.gov/pubmed/36333542
http://dx.doi.org/10.1007/s10654-022-00934-w
work_keys_str_mv AT shawrichardj biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT harronkatiel biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT pescarinijuliam biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT pintojuniorelzopereira biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT allikmirjam biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT sirokyandressan biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT campbelldesmond biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT dundasruth biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT ichiharamariayury biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT leylandalastairh biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT barretomauriciol biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses
AT katikireddisrinivasavittal biasesarisingfromlinkedadministrativedataforepidemiologicalresearchaconceptualframeworkfromregistrationtoanalyses