Cargando…
Uncovering interpretable potential confounders in electronic medical records
Randomized clinical trials (RCT) are the gold standard for informing treatment decisions. Observational studies are often plagued by selection bias, and expert-selected covariates may insufficiently adjust for confounding. We explore how unstructured clinical text can be used to reduce selection bia...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8866497/ https://www.ncbi.nlm.nih.gov/pubmed/35197467 http://dx.doi.org/10.1038/s41467-022-28546-8 |
_version_ | 1784655852664258560 |
---|---|
author | Zeng, Jiaming Gensheimer, Michael F. Rubin, Daniel L. Athey, Susan Shachter, Ross D. |
author_facet | Zeng, Jiaming Gensheimer, Michael F. Rubin, Daniel L. Athey, Susan Shachter, Ross D. |
author_sort | Zeng, Jiaming |
collection | PubMed |
description | Randomized clinical trials (RCT) are the gold standard for informing treatment decisions. Observational studies are often plagued by selection bias, and expert-selected covariates may insufficiently adjust for confounding. We explore how unstructured clinical text can be used to reduce selection bias and improve medical practice. We develop a framework based on natural language processing to uncover interpretable potential confounders from text. We validate our method by comparing the estimated hazard ratio (HR) with and without the confounders against established RCTs. We apply our method to four cohorts built from localized prostate and lung cancer datasets from the Stanford Cancer Institute and show that our method shifts the HR estimate towards the RCT results. The uncovered terms can also be interpreted by oncologists for clinical insights. We present this proof-of-concept study to enable more credible causal inference using observational data, uncover meaningful insights from clinical text, and inform high-stakes medical decisions. |
format | Online Article Text |
id | pubmed-8866497 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-88664972022-03-17 Uncovering interpretable potential confounders in electronic medical records Zeng, Jiaming Gensheimer, Michael F. Rubin, Daniel L. Athey, Susan Shachter, Ross D. Nat Commun Article Randomized clinical trials (RCT) are the gold standard for informing treatment decisions. Observational studies are often plagued by selection bias, and expert-selected covariates may insufficiently adjust for confounding. We explore how unstructured clinical text can be used to reduce selection bias and improve medical practice. We develop a framework based on natural language processing to uncover interpretable potential confounders from text. We validate our method by comparing the estimated hazard ratio (HR) with and without the confounders against established RCTs. We apply our method to four cohorts built from localized prostate and lung cancer datasets from the Stanford Cancer Institute and show that our method shifts the HR estimate towards the RCT results. The uncovered terms can also be interpreted by oncologists for clinical insights. We present this proof-of-concept study to enable more credible causal inference using observational data, uncover meaningful insights from clinical text, and inform high-stakes medical decisions. Nature Publishing Group UK 2022-02-23 /pmc/articles/PMC8866497/ /pubmed/35197467 http://dx.doi.org/10.1038/s41467-022-28546-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zeng, Jiaming Gensheimer, Michael F. Rubin, Daniel L. Athey, Susan Shachter, Ross D. Uncovering interpretable potential confounders in electronic medical records |
title | Uncovering interpretable potential confounders in electronic medical records |
title_full | Uncovering interpretable potential confounders in electronic medical records |
title_fullStr | Uncovering interpretable potential confounders in electronic medical records |
title_full_unstemmed | Uncovering interpretable potential confounders in electronic medical records |
title_short | Uncovering interpretable potential confounders in electronic medical records |
title_sort | uncovering interpretable potential confounders in electronic medical records |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8866497/ https://www.ncbi.nlm.nih.gov/pubmed/35197467 http://dx.doi.org/10.1038/s41467-022-28546-8 |
work_keys_str_mv | AT zengjiaming uncoveringinterpretablepotentialconfoundersinelectronicmedicalrecords AT gensheimermichaelf uncoveringinterpretablepotentialconfoundersinelectronicmedicalrecords AT rubindaniell uncoveringinterpretablepotentialconfoundersinelectronicmedicalrecords AT atheysusan uncoveringinterpretablepotentialconfoundersinelectronicmedicalrecords AT shachterrossd uncoveringinterpretablepotentialconfoundersinelectronicmedicalrecords |