Cargando…

Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research

BACKGROUND: To demonstrate that subject selection based on sufficient laboratory results and medication orders in electronic health records can be biased towards sick patients. METHODS: Using electronic health record data from 10,000 patients who received anesthetic services at a major metropolitan...

Descripción completa

Detalles Bibliográficos
Autores principales: Rusanov, Alexander, Weiskopf, Nicole G, Wang, Shuang, Weng, Chunhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062889/
https://www.ncbi.nlm.nih.gov/pubmed/24916006
http://dx.doi.org/10.1186/1472-6947-14-51
_version_ 1782321706599186432
author Rusanov, Alexander
Weiskopf, Nicole G
Wang, Shuang
Weng, Chunhua
author_facet Rusanov, Alexander
Weiskopf, Nicole G
Wang, Shuang
Weng, Chunhua
author_sort Rusanov, Alexander
collection PubMed
description BACKGROUND: To demonstrate that subject selection based on sufficient laboratory results and medication orders in electronic health records can be biased towards sick patients. METHODS: Using electronic health record data from 10,000 patients who received anesthetic services at a major metropolitan tertiary care academic medical center, an affiliated hospital for women and children, and an affiliated urban primary care hospital, the correlation between patient health status and counts of days with laboratory results or medication orders, as indicated by the American Society of Anesthesiologists Physical Status Classification (ASA Class), was assessed with a Negative Binomial Regression model. RESULTS: Higher ASA Class was associated with more points of data: compared to ASA Class 1 patients, ASA Class 4 patients had 5.05 times the number of days with laboratory results and 6.85 times the number of days with medication orders, controlling for age, sex, emergency status, admission type, primary diagnosis, and procedure. CONCLUSIONS: Imposing data sufficiency requirements for subject selection allows researchers to minimize missing data when reusing electronic health records for research, but introduces a bias towards the selection of sicker patients. We demonstrated the relationship between patient health and quantity of data, which may result in a systematic bias towards the selection of sicker patients for research studies and limit the external validity of research conducted using electronic health record data. Additionally, we discovered other variables (i.e., admission status, age, emergency classification, procedure, and diagnosis) that independently affect data sufficiency.
format Online
Article
Text
id pubmed-4062889
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40628892014-06-20 Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research Rusanov, Alexander Weiskopf, Nicole G Wang, Shuang Weng, Chunhua BMC Med Inform Decis Mak Research Article BACKGROUND: To demonstrate that subject selection based on sufficient laboratory results and medication orders in electronic health records can be biased towards sick patients. METHODS: Using electronic health record data from 10,000 patients who received anesthetic services at a major metropolitan tertiary care academic medical center, an affiliated hospital for women and children, and an affiliated urban primary care hospital, the correlation between patient health status and counts of days with laboratory results or medication orders, as indicated by the American Society of Anesthesiologists Physical Status Classification (ASA Class), was assessed with a Negative Binomial Regression model. RESULTS: Higher ASA Class was associated with more points of data: compared to ASA Class 1 patients, ASA Class 4 patients had 5.05 times the number of days with laboratory results and 6.85 times the number of days with medication orders, controlling for age, sex, emergency status, admission type, primary diagnosis, and procedure. CONCLUSIONS: Imposing data sufficiency requirements for subject selection allows researchers to minimize missing data when reusing electronic health records for research, but introduces a bias towards the selection of sicker patients. We demonstrated the relationship between patient health and quantity of data, which may result in a systematic bias towards the selection of sicker patients for research studies and limit the external validity of research conducted using electronic health record data. Additionally, we discovered other variables (i.e., admission status, age, emergency classification, procedure, and diagnosis) that independently affect data sufficiency. BioMed Central 2014-06-11 /pmc/articles/PMC4062889/ /pubmed/24916006 http://dx.doi.org/10.1186/1472-6947-14-51 Text en Copyright © 2014 Rusanov et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Rusanov, Alexander
Weiskopf, Nicole G
Wang, Shuang
Weng, Chunhua
Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
title Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
title_full Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
title_fullStr Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
title_full_unstemmed Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
title_short Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
title_sort hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062889/
https://www.ncbi.nlm.nih.gov/pubmed/24916006
http://dx.doi.org/10.1186/1472-6947-14-51
work_keys_str_mv AT rusanovalexander hiddeninplainsightbiastowardssickpatientswhensamplingpatientswithsufficientelectronichealthrecorddataforresearch
AT weiskopfnicoleg hiddeninplainsightbiastowardssickpatientswhensamplingpatientswithsufficientelectronichealthrecorddataforresearch
AT wangshuang hiddeninplainsightbiastowardssickpatientswhensamplingpatientswithsufficientelectronichealthrecorddataforresearch
AT wengchunhua hiddeninplainsightbiastowardssickpatientswhensamplingpatientswithsufficientelectronichealthrecorddataforresearch