Cargando…

Machine learning algorithm for early detection of end-stage renal disease

BACKGROUND: End stage renal disease (ESRD) describes the most severe stage of chronic kidney disease (CKD), when patients need dialysis or renal transplant. There is often a delay in recognizing, diagnosing, and treating the various etiologies of CKD. The objective of the present study was to employ...

Descripción completa

Detalles Bibliográficos
Autores principales: Segal, Zvi, Kalifa, Dan, Radinsky, Kira, Ehrenberg, Bar, Elad, Guy, Maor, Gal, Lewis, Maor, Tibi, Muhammad, Korn, Liat, Koren, Gideon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7693522/
https://www.ncbi.nlm.nih.gov/pubmed/33246427
http://dx.doi.org/10.1186/s12882-020-02093-0
_version_ 1783614763896406016
author Segal, Zvi
Kalifa, Dan
Radinsky, Kira
Ehrenberg, Bar
Elad, Guy
Maor, Gal
Lewis, Maor
Tibi, Muhammad
Korn, Liat
Koren, Gideon
author_facet Segal, Zvi
Kalifa, Dan
Radinsky, Kira
Ehrenberg, Bar
Elad, Guy
Maor, Gal
Lewis, Maor
Tibi, Muhammad
Korn, Liat
Koren, Gideon
author_sort Segal, Zvi
collection PubMed
description BACKGROUND: End stage renal disease (ESRD) describes the most severe stage of chronic kidney disease (CKD), when patients need dialysis or renal transplant. There is often a delay in recognizing, diagnosing, and treating the various etiologies of CKD. The objective of the present study was to employ machine learning algorithms to develop a prediction model for progression to ESRD based on a large-scale multidimensional database. METHODS: This study analyzed 10,000,000 medical insurance claims from 550,000 patient records using a commercial health insurance database. Inclusion criteria were patients over the age of 18 diagnosed with CKD Stages 1–4. We compiled 240 predictor candidates, divided into six feature groups: demographics, chronic conditions, diagnosis and procedure features, medication features, medical costs, and episode counts. We used a feature embedding method based on implementation of the Word2Vec algorithm to further capture temporal information for the three main components of the data: diagnosis, procedures, and medications. For the analysis, we used the gradient boosting tree algorithm (XGBoost implementation). RESULTS: The C-statistic for the model was 0.93 [(0.916–0.943) 95% confidence interval], with a sensitivity of 0.715 and specificity of 0.958. Positive Predictive Value (PPV) was 0.517, and Negative Predictive Value (NPV) was 0.981. For the top 1 percentile of patients identified by our model, the PPV was 1.0. In addition, for the top 5 percentile of patients identified by our model, the PPV was 0.71. All the results above were tested on the test data only, and the threshold used to obtain these results was 0.1. Notable features contributing to the model were chronic heart and ischemic heart disease as a comorbidity, patient age, and number of hypertensive crisis events. CONCLUSIONS: When a patient is approaching the threshold of ESRD risk, a warning message can be sent electronically to the physician, who will initiate a referral for a nephrology consultation to ensure an investigation to hasten the establishment of a diagnosis and initiate management and therapy when appropriate.
format Online
Article
Text
id pubmed-7693522
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-76935222020-11-30 Machine learning algorithm for early detection of end-stage renal disease Segal, Zvi Kalifa, Dan Radinsky, Kira Ehrenberg, Bar Elad, Guy Maor, Gal Lewis, Maor Tibi, Muhammad Korn, Liat Koren, Gideon BMC Nephrol Research Article BACKGROUND: End stage renal disease (ESRD) describes the most severe stage of chronic kidney disease (CKD), when patients need dialysis or renal transplant. There is often a delay in recognizing, diagnosing, and treating the various etiologies of CKD. The objective of the present study was to employ machine learning algorithms to develop a prediction model for progression to ESRD based on a large-scale multidimensional database. METHODS: This study analyzed 10,000,000 medical insurance claims from 550,000 patient records using a commercial health insurance database. Inclusion criteria were patients over the age of 18 diagnosed with CKD Stages 1–4. We compiled 240 predictor candidates, divided into six feature groups: demographics, chronic conditions, diagnosis and procedure features, medication features, medical costs, and episode counts. We used a feature embedding method based on implementation of the Word2Vec algorithm to further capture temporal information for the three main components of the data: diagnosis, procedures, and medications. For the analysis, we used the gradient boosting tree algorithm (XGBoost implementation). RESULTS: The C-statistic for the model was 0.93 [(0.916–0.943) 95% confidence interval], with a sensitivity of 0.715 and specificity of 0.958. Positive Predictive Value (PPV) was 0.517, and Negative Predictive Value (NPV) was 0.981. For the top 1 percentile of patients identified by our model, the PPV was 1.0. In addition, for the top 5 percentile of patients identified by our model, the PPV was 0.71. All the results above were tested on the test data only, and the threshold used to obtain these results was 0.1. Notable features contributing to the model were chronic heart and ischemic heart disease as a comorbidity, patient age, and number of hypertensive crisis events. CONCLUSIONS: When a patient is approaching the threshold of ESRD risk, a warning message can be sent electronically to the physician, who will initiate a referral for a nephrology consultation to ensure an investigation to hasten the establishment of a diagnosis and initiate management and therapy when appropriate. BioMed Central 2020-11-27 /pmc/articles/PMC7693522/ /pubmed/33246427 http://dx.doi.org/10.1186/s12882-020-02093-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Segal, Zvi
Kalifa, Dan
Radinsky, Kira
Ehrenberg, Bar
Elad, Guy
Maor, Gal
Lewis, Maor
Tibi, Muhammad
Korn, Liat
Koren, Gideon
Machine learning algorithm for early detection of end-stage renal disease
title Machine learning algorithm for early detection of end-stage renal disease
title_full Machine learning algorithm for early detection of end-stage renal disease
title_fullStr Machine learning algorithm for early detection of end-stage renal disease
title_full_unstemmed Machine learning algorithm for early detection of end-stage renal disease
title_short Machine learning algorithm for early detection of end-stage renal disease
title_sort machine learning algorithm for early detection of end-stage renal disease
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7693522/
https://www.ncbi.nlm.nih.gov/pubmed/33246427
http://dx.doi.org/10.1186/s12882-020-02093-0
work_keys_str_mv AT segalzvi machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT kalifadan machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT radinskykira machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT ehrenbergbar machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT eladguy machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT maorgal machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT lewismaor machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT tibimuhammad machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT kornliat machinelearningalgorithmforearlydetectionofendstagerenaldisease
AT korengideon machinelearningalgorithmforearlydetectionofendstagerenaldisease