Cargando…

Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies

Automated identification of advanced chronic kidney disease (CKD ≥ III) and of no known kidney disease (NKD) can support both clinicians and researchers. We hypothesized that identification of CKD and NKD can be improved, by combining information from different electronic health record (EHR) resourc...

Descripción completa

Detalles Bibliográficos
Autores principales: Weber, Christoph, Röschke, Lena, Modersohn, Luise, Lohr, Christina, Kolditz, Tobias, Hahn, Udo, Ammon, Danny, Betz, Boris, Kiehntopf, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7563476/
https://www.ncbi.nlm.nih.gov/pubmed/32932685
http://dx.doi.org/10.3390/jcm9092955
_version_ 1783595497817112576
author Weber, Christoph
Röschke, Lena
Modersohn, Luise
Lohr, Christina
Kolditz, Tobias
Hahn, Udo
Ammon, Danny
Betz, Boris
Kiehntopf, Michael
author_facet Weber, Christoph
Röschke, Lena
Modersohn, Luise
Lohr, Christina
Kolditz, Tobias
Hahn, Udo
Ammon, Danny
Betz, Boris
Kiehntopf, Michael
author_sort Weber, Christoph
collection PubMed
description Automated identification of advanced chronic kidney disease (CKD ≥ III) and of no known kidney disease (NKD) can support both clinicians and researchers. We hypothesized that identification of CKD and NKD can be improved, by combining information from different electronic health record (EHR) resources, comprising laboratory values, discharge summaries and ICD-10 billing codes, compared to using each component alone. We included EHRs from 785 elderly multimorbid patients, hospitalized between 2010 and 2015, that were divided into a training and a test (n = 156) dataset. We used both the area under the receiver operating characteristic (AUROC) and under the precision-recall curve (AUCPR) with a 95% confidence interval for evaluation of different classification models. In the test dataset, the combination of EHR components as a simple classifier identified CKD ≥ III (AUROC 0.96[0.93–0.98]) and NKD (AUROC 0.94[0.91–0.97]) better than laboratory values (AUROC CKD 0.85[0.79–0.90], NKD 0.91[0.87–0.94]), discharge summaries (AUROC CKD 0.87[0.82–0.92], NKD 0.84[0.79–0.89]) or ICD-10 billing codes (AUROC CKD 0.85[0.80–0.91], NKD 0.77[0.72–0.83]) alone. Logistic regression and machine learning models improved recognition of CKD ≥ III compared to the simple classifier if only laboratory values were used (AUROC 0.96[0.92–0.99] vs. 0.86[0.81–0.91], p < 0.05) and improved recognition of NKD if information from previous hospital stays was used (AUROC 0.99[0.98–1.00] vs. 0.95[0.92–0.97]], p < 0.05). Depending on the availability of data, correct automated identification of CKD ≥ III and NKD from EHRs can be improved by generating classification models based on the combination of different EHR components.
format Online
Article
Text
id pubmed-7563476
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75634762020-10-27 Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies Weber, Christoph Röschke, Lena Modersohn, Luise Lohr, Christina Kolditz, Tobias Hahn, Udo Ammon, Danny Betz, Boris Kiehntopf, Michael J Clin Med Article Automated identification of advanced chronic kidney disease (CKD ≥ III) and of no known kidney disease (NKD) can support both clinicians and researchers. We hypothesized that identification of CKD and NKD can be improved, by combining information from different electronic health record (EHR) resources, comprising laboratory values, discharge summaries and ICD-10 billing codes, compared to using each component alone. We included EHRs from 785 elderly multimorbid patients, hospitalized between 2010 and 2015, that were divided into a training and a test (n = 156) dataset. We used both the area under the receiver operating characteristic (AUROC) and under the precision-recall curve (AUCPR) with a 95% confidence interval for evaluation of different classification models. In the test dataset, the combination of EHR components as a simple classifier identified CKD ≥ III (AUROC 0.96[0.93–0.98]) and NKD (AUROC 0.94[0.91–0.97]) better than laboratory values (AUROC CKD 0.85[0.79–0.90], NKD 0.91[0.87–0.94]), discharge summaries (AUROC CKD 0.87[0.82–0.92], NKD 0.84[0.79–0.89]) or ICD-10 billing codes (AUROC CKD 0.85[0.80–0.91], NKD 0.77[0.72–0.83]) alone. Logistic regression and machine learning models improved recognition of CKD ≥ III compared to the simple classifier if only laboratory values were used (AUROC 0.96[0.92–0.99] vs. 0.86[0.81–0.91], p < 0.05) and improved recognition of NKD if information from previous hospital stays was used (AUROC 0.99[0.98–1.00] vs. 0.95[0.92–0.97]], p < 0.05). Depending on the availability of data, correct automated identification of CKD ≥ III and NKD from EHRs can be improved by generating classification models based on the combination of different EHR components. MDPI 2020-09-12 /pmc/articles/PMC7563476/ /pubmed/32932685 http://dx.doi.org/10.3390/jcm9092955 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Weber, Christoph
Röschke, Lena
Modersohn, Luise
Lohr, Christina
Kolditz, Tobias
Hahn, Udo
Ammon, Danny
Betz, Boris
Kiehntopf, Michael
Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies
title Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies
title_full Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies
title_fullStr Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies
title_full_unstemmed Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies
title_short Optimized Identification of Advanced Chronic Kidney Disease and Absence of Kidney Disease by Combining Different Electronic Health Data Resources and by Applying Machine Learning Strategies
title_sort optimized identification of advanced chronic kidney disease and absence of kidney disease by combining different electronic health data resources and by applying machine learning strategies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7563476/
https://www.ncbi.nlm.nih.gov/pubmed/32932685
http://dx.doi.org/10.3390/jcm9092955
work_keys_str_mv AT weberchristoph optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT roschkelena optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT modersohnluise optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT lohrchristina optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT kolditztobias optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT hahnudo optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT ammondanny optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT betzboris optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies
AT kiehntopfmichael optimizedidentificationofadvancedchronickidneydiseaseandabsenceofkidneydiseasebycombiningdifferentelectronichealthdataresourcesandbyapplyingmachinelearningstrategies