Cargando…

Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse

OBJECTIVES: Collider bias is a common threat to internal validity in clinical research but is rarely mentioned in informatics education or literature. Conditioning on a collider, which is a variable that is the shared causal descendant of an exposure and outcome, may result in spurious associations...

Descripción completa

Detalles Bibliográficos
Autores principales: Weiskopf, Nicole G, Dorr, David A, Jackson, Christie, Lehmann, Harold P, Thompson, Caroline A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10114115/
https://www.ncbi.nlm.nih.gov/pubmed/36752649
http://dx.doi.org/10.1093/jamia/ocad013
_version_ 1785027965448355840
author Weiskopf, Nicole G
Dorr, David A
Jackson, Christie
Lehmann, Harold P
Thompson, Caroline A
author_facet Weiskopf, Nicole G
Dorr, David A
Jackson, Christie
Lehmann, Harold P
Thompson, Caroline A
author_sort Weiskopf, Nicole G
collection PubMed
description OBJECTIVES: Collider bias is a common threat to internal validity in clinical research but is rarely mentioned in informatics education or literature. Conditioning on a collider, which is a variable that is the shared causal descendant of an exposure and outcome, may result in spurious associations between the exposure and outcome. Our objective is to introduce readers to collider bias and its corollaries in the retrospective analysis of electronic health record (EHR) data. TARGET AUDIENCE: Collider bias is likely to arise in the reuse of EHR data, due to data-generating mechanisms and the nature of healthcare access and utilization in the United States. Therefore, this tutorial is aimed at informaticians and other EHR data consumers without a background in epidemiological methods or causal inference. SCOPE: We focus specifically on problems that may arise from conditioning on forms of healthcare utilization, a common collider that is an implicit selection criterion when one reuses EHR data. Directed acyclic graphs (DAGs) are introduced as a tool for identifying potential sources of bias during study design and planning. References for additional resources on causal inference and DAG construction are provided.
format Online
Article
Text
id pubmed-10114115
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101141152023-04-20 Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse Weiskopf, Nicole G Dorr, David A Jackson, Christie Lehmann, Harold P Thompson, Caroline A J Am Med Inform Assoc Review OBJECTIVES: Collider bias is a common threat to internal validity in clinical research but is rarely mentioned in informatics education or literature. Conditioning on a collider, which is a variable that is the shared causal descendant of an exposure and outcome, may result in spurious associations between the exposure and outcome. Our objective is to introduce readers to collider bias and its corollaries in the retrospective analysis of electronic health record (EHR) data. TARGET AUDIENCE: Collider bias is likely to arise in the reuse of EHR data, due to data-generating mechanisms and the nature of healthcare access and utilization in the United States. Therefore, this tutorial is aimed at informaticians and other EHR data consumers without a background in epidemiological methods or causal inference. SCOPE: We focus specifically on problems that may arise from conditioning on forms of healthcare utilization, a common collider that is an implicit selection criterion when one reuses EHR data. Directed acyclic graphs (DAGs) are introduced as a tool for identifying potential sources of bias during study design and planning. References for additional resources on causal inference and DAG construction are provided. Oxford University Press 2023-02-08 /pmc/articles/PMC10114115/ /pubmed/36752649 http://dx.doi.org/10.1093/jamia/ocad013 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review
Weiskopf, Nicole G
Dorr, David A
Jackson, Christie
Lehmann, Harold P
Thompson, Caroline A
Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse
title Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse
title_full Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse
title_fullStr Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse
title_full_unstemmed Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse
title_short Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse
title_sort healthcare utilization is a collider: an introduction to collider bias in ehr data reuse
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10114115/
https://www.ncbi.nlm.nih.gov/pubmed/36752649
http://dx.doi.org/10.1093/jamia/ocad013
work_keys_str_mv AT weiskopfnicoleg healthcareutilizationisacollideranintroductiontocolliderbiasinehrdatareuse
AT dorrdavida healthcareutilizationisacollideranintroductiontocolliderbiasinehrdatareuse
AT jacksonchristie healthcareutilizationisacollideranintroductiontocolliderbiasinehrdatareuse
AT lehmannharoldp healthcareutilizationisacollideranintroductiontocolliderbiasinehrdatareuse
AT thompsoncarolinea healthcareutilizationisacollideranintroductiontocolliderbiasinehrdatareuse