Cargando…

Causal risk factor discovery for severe acute kidney injury using electronic health records

BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Weiqi, Hu, Yong, Zhang, Xiangzhou, Wu, Lijuan, Liu, Kang, He, Jianqin, Tang, Zilin, Song, Xing, Waitman, Lemuel R., Liu, Mei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872516/
https://www.ncbi.nlm.nih.gov/pubmed/29589567
http://dx.doi.org/10.1186/s12911-018-0597-7
_version_ 1783309853424353280
author Chen, Weiqi
Hu, Yong
Zhang, Xiangzhou
Wu, Lijuan
Liu, Kang
He, Jianqin
Tang, Zilin
Song, Xing
Waitman, Lemuel R.
Liu, Mei
author_facet Chen, Weiqi
Hu, Yong
Zhang, Xiangzhou
Wu, Lijuan
Liu, Kang
He, Jianqin
Tang, Zilin
Song, Xing
Waitman, Lemuel R.
Liu, Mei
author_sort Chen, Weiqi
collection PubMed
description BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible. It is important to effectively discover the true risk factors in order to identify high-risk AKI patients and allow better targeting of tailored interventions. However, Stage-3 AKI patients are very rare (only 0.2% of AKI patients) with a large scale of features available in EHR (1917 potential risk features), yielding a scenario unfeasible for any correlation-based feature selection or modeling method. This study aims to discover the key factors and improve the detection of Stage-3 AKI. METHODS: A causal discovery method (McDSL) is adopted for causal discovery to infer true causal relationship between information buried in EHR (such as medication, diagnosis, laboratory tests, comorbidities and etc.) and Stage-3 AKI risk. The research approach comprised two major phases: data collection, and causal discovery. The first phase is propose to collect the data from HER (includes 358 encounters and 891 risk factors). Finally, McDSL is employed to discover the causal risk factors of Stage-3 AKI, and five well-known machine learning models are built for predicting Stage-3 AKI with 10-fold cross-validation (predictive accuracy were measured by AUC, precision, recall and F-score). RESULTS: McDSL is useful for further research of EHR. It is able to discover four causal features, all selected features are medications that are modifiable. The latest research of machine learning is employed to compare the performance of prediction, and the experimental result has verified the selected features are pivotal. CONCLUSIONS: The features selected by McDSL, which enable us to achieve significant dimension reduction without sacrificing prediction accuracy, suggesting potential clinical use such as helping physicians develop better prevention and treatment strategies.
format Online
Article
Text
id pubmed-5872516
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58725162018-04-02 Causal risk factor discovery for severe acute kidney injury using electronic health records Chen, Weiqi Hu, Yong Zhang, Xiangzhou Wu, Lijuan Liu, Kang He, Jianqin Tang, Zilin Song, Xing Waitman, Lemuel R. Liu, Mei BMC Med Inform Decis Mak Research BACKGROUND: Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible. It is important to effectively discover the true risk factors in order to identify high-risk AKI patients and allow better targeting of tailored interventions. However, Stage-3 AKI patients are very rare (only 0.2% of AKI patients) with a large scale of features available in EHR (1917 potential risk features), yielding a scenario unfeasible for any correlation-based feature selection or modeling method. This study aims to discover the key factors and improve the detection of Stage-3 AKI. METHODS: A causal discovery method (McDSL) is adopted for causal discovery to infer true causal relationship between information buried in EHR (such as medication, diagnosis, laboratory tests, comorbidities and etc.) and Stage-3 AKI risk. The research approach comprised two major phases: data collection, and causal discovery. The first phase is propose to collect the data from HER (includes 358 encounters and 891 risk factors). Finally, McDSL is employed to discover the causal risk factors of Stage-3 AKI, and five well-known machine learning models are built for predicting Stage-3 AKI with 10-fold cross-validation (predictive accuracy were measured by AUC, precision, recall and F-score). RESULTS: McDSL is useful for further research of EHR. It is able to discover four causal features, all selected features are medications that are modifiable. The latest research of machine learning is employed to compare the performance of prediction, and the experimental result has verified the selected features are pivotal. CONCLUSIONS: The features selected by McDSL, which enable us to achieve significant dimension reduction without sacrificing prediction accuracy, suggesting potential clinical use such as helping physicians develop better prevention and treatment strategies. BioMed Central 2018-03-22 /pmc/articles/PMC5872516/ /pubmed/29589567 http://dx.doi.org/10.1186/s12911-018-0597-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Chen, Weiqi
Hu, Yong
Zhang, Xiangzhou
Wu, Lijuan
Liu, Kang
He, Jianqin
Tang, Zilin
Song, Xing
Waitman, Lemuel R.
Liu, Mei
Causal risk factor discovery for severe acute kidney injury using electronic health records
title Causal risk factor discovery for severe acute kidney injury using electronic health records
title_full Causal risk factor discovery for severe acute kidney injury using electronic health records
title_fullStr Causal risk factor discovery for severe acute kidney injury using electronic health records
title_full_unstemmed Causal risk factor discovery for severe acute kidney injury using electronic health records
title_short Causal risk factor discovery for severe acute kidney injury using electronic health records
title_sort causal risk factor discovery for severe acute kidney injury using electronic health records
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872516/
https://www.ncbi.nlm.nih.gov/pubmed/29589567
http://dx.doi.org/10.1186/s12911-018-0597-7
work_keys_str_mv AT chenweiqi causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT huyong causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT zhangxiangzhou causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT wulijuan causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT liukang causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT hejianqin causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT tangzilin causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT songxing causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT waitmanlemuelr causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords
AT liumei causalriskfactordiscoveryforsevereacutekidneyinjuryusingelectronichealthrecords