Cargando…
Causal risk factor discovery for severe acute kidney injury using electronic health records
BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872516/ https://www.ncbi.nlm.nih.gov/pubmed/29589567 http://dx.doi.org/10.1186/s12911-018-0597-7 |
_version_ | 1783309853424353280 |
---|---|
author | Chen, Weiqi Hu, Yong Zhang, Xiangzhou Wu, Lijuan Liu, Kang He, Jianqin Tang, Zilin Song, Xing Waitman, Lemuel R. Liu, Mei |
author_facet | Chen, Weiqi Hu, Yong Zhang, Xiangzhou Wu, Lijuan Liu, Kang He, Jianqin Tang, Zilin Song, Xing Waitman, Lemuel R. Liu, Mei |
author_sort | Chen, Weiqi |
collection | PubMed |
description | BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible. It is important to effectively discover the true risk factors in order to identify high-risk AKI patients and allow better targeting of tailored interventions. However, Stage-3 AKI patients are very rare (only 0.2% of AKI patients) with a large scale of features available in EHR (1917 potential risk features), yielding a scenario unfeasible for any correlation-based feature selection or modeling method. This study aims to discover the key factors and improve the detection of Stage-3 AKI. METHODS: A causal discovery method (McDSL) is adopted for causal discovery to infer true causal relationship between information buried in EHR (such as medication, diagnosis, laboratory tests, comorbidities and etc.) and Stage-3 AKI risk. The research approach comprised two major phases: data collection, and causal discovery. The first phase is propose to collect the data from HER (includes 358 encounters and 891 risk factors). Finally, McDSL is employed to discover the causal risk factors of Stage-3 AKI, and five well-known machine learning models are built for predicting Stage-3 AKI with 10-fold cross-validation (predictive accuracy were measured by AUC, precision, recall and F-score). RESULTS: McDSL is useful for further research of EHR. It is able to discover four causal features, all selected features are medications that are modifiable. The latest research of machine learning is employed to compare the performance of prediction, and the experimental result has verified the selected features are pivotal. CONCLUSIONS: The features selected by McDSL, which enable us to achieve significant dimension reduction without sacrificing prediction accuracy, suggesting potential clinical use such as helping physicians develop better prevention and treatment strategies. |
format | Online Article Text |
id | pubmed-5872516 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58725162018-04-02 Causal risk factor discovery for severe acute kidney injury using electronic health records Chen, Weiqi Hu, Yong Zhang, Xiangzhou Wu, Lijuan Liu, Kang He, Jianqin Tang, Zilin Song, Xing Waitman, Lemuel R. Liu, Mei BMC Med Inform Decis Mak Research BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible. It is important to effectively discover the true risk factors in order to identify high-risk AKI patients and allow better targeting of tailored interventions. However, Stage-3 AKI patients are very rare (only 0.2% of AKI patients) with a large scale of features available in EHR (1917 potential risk features), yielding a scenario unfeasible for any correlation-based feature selection or modeling method. This study aims to discover the key factors and improve the detection of Stage-3 AKI. METHODS: A causal discovery method (McDSL) is adopted for causal discovery to infer true causal relationship between information buried in EHR (such as medication, diagnosis, laboratory tests, comorbidities and etc.) and Stage-3 AKI risk. The research approach comprised two major phases: data collection, and causal discovery. The first phase is propose to collect the data from HER (includes 358 encounters and 891 risk factors). Finally, McDSL is employed to discover the causal risk factors of Stage-3 AKI, and five well-known machine learning models are built for predicting Stage-3 AKI with 10-fold cross-validation (predictive accuracy were measured by AUC, precision, recall and F-score). RESULTS: McDSL is useful for further research of EHR. It is able to discover four causal features, all selected features are medications that are modifiable. The latest research of machine learning is employed to compare the performance of prediction, and the experimental result has verified the selected features are pivotal. CONCLUSIONS: The features selected by McDSL, which enable us to achieve significant dimension reduction without sacrificing prediction accuracy, suggesting potential clinical use such as helping physicians develop better prevention and treatment strategies. BioMed Central 2018-03-22 /pmc/articles/PMC5872516/ /pubmed/29589567 http://dx.doi.org/10.1186/s12911-018-0597-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Chen, Weiqi Hu, Yong Zhang, Xiangzhou Wu, Lijuan Liu, Kang He, Jianqin Tang, Zilin Song, Xing Waitman, Lemuel R. Liu, Mei Causal risk factor discovery for severe acute kidney injury using electronic health records |
title | Causal risk factor discovery for severe acute kidney injury using electronic health records |
title_full | Causal risk factor discovery for severe acute kidney injury using electronic health records |
title_fullStr | Causal risk factor discovery for severe acute kidney injury using electronic health records |
title_full_unstemmed | Causal risk factor discovery for severe acute kidney injury using electronic health records |
title_short | Causal risk factor discovery for severe acute kidney injury using electronic health records |
title_sort | causal risk factor discovery for severe acute kidney injury using electronic health records |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872516/ https://www.ncbi.nlm.nih.gov/pubmed/29589567 http://dx.doi.org/10.1186/s12911-018-0597-7 |
work_keys_str_mv | AT chenweiqi causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT huyong causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT zhangxiangzhou causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT wulijuan causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT liukang causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT hejianqin causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT tangzilin causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT songxing causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT waitmanlemuelr causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords AT liumei causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords |