Cargando…

Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill

BACKGROUND: With a large-scale electronic health record repository, it is feasible to build a customized patient outcome prediction model specifically for a given patient. This approach involves identifying past patients who are similar to the present patient and using their data to train a personal...

Descripción completa

Detalles Bibliográficos
Autor principal: Lee, Joon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5285604/
https://www.ncbi.nlm.nih.gov/pubmed/28096065
http://dx.doi.org/10.2196/medinform.6690
_version_ 1782503861907357696
author Lee, Joon
author_facet Lee, Joon
author_sort Lee, Joon
collection PubMed
description BACKGROUND: With a large-scale electronic health record repository, it is feasible to build a customized patient outcome prediction model specifically for a given patient. This approach involves identifying past patients who are similar to the present patient and using their data to train a personalized predictive model. Our previous work investigated a cosine-similarity patient similarity metric (PSM) for such patient-specific predictive modeling. OBJECTIVE: The objective of the study is to investigate the random forest (RF) proximity measure as a PSM in the context of personalized mortality prediction for intensive care unit (ICU) patients. METHODS: A total of 17,152 ICU admissions were extracted from the Multiparameter Intelligent Monitoring in Intensive Care II database. A number of predictor variables were extracted from the first 24 hours in the ICU. Outcome to be predicted was 30-day mortality. A patient-specific predictive model was trained for each ICU admission using an RF PSM inspired by the RF proximity measure. Death counting, logistic regression, decision tree, and RF models were studied with a hard threshold applied to RF PSM values to only include the M most similar patients in model training, where M was varied. In addition, case-specific random forests (CSRFs), which uses RF proximity for weighted bootstrapping, were trained. RESULTS: Compared to our previous study that investigated a cosine similarity PSM, the RF PSM resulted in superior or comparable predictive performance. RF and CSRF exhibited the best performances (in terms of mean area under the receiver operating characteristic curve [95% confidence interval], RF: 0.839 [0.835-0.844]; CSRF: 0.832 [0.821-0.843]). RF and CSRF did not benefit from personalization via the use of the RF PSM, while the other models did. CONCLUSIONS: The RF PSM led to good mortality prediction performance for several predictive models, although it failed to induce improved performance in RF and CSRF. The distinction between predictor and similarity variables is an important issue arising from the present study. RFs present a promising method for patient-specific outcome prediction.
format Online
Article
Text
id pubmed-5285604
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-52856042017-02-15 Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill Lee, Joon JMIR Med Inform Original Paper BACKGROUND: With a large-scale electronic health record repository, it is feasible to build a customized patient outcome prediction model specifically for a given patient. This approach involves identifying past patients who are similar to the present patient and using their data to train a personalized predictive model. Our previous work investigated a cosine-similarity patient similarity metric (PSM) for such patient-specific predictive modeling. OBJECTIVE: The objective of the study is to investigate the random forest (RF) proximity measure as a PSM in the context of personalized mortality prediction for intensive care unit (ICU) patients. METHODS: A total of 17,152 ICU admissions were extracted from the Multiparameter Intelligent Monitoring in Intensive Care II database. A number of predictor variables were extracted from the first 24 hours in the ICU. Outcome to be predicted was 30-day mortality. A patient-specific predictive model was trained for each ICU admission using an RF PSM inspired by the RF proximity measure. Death counting, logistic regression, decision tree, and RF models were studied with a hard threshold applied to RF PSM values to only include the M most similar patients in model training, where M was varied. In addition, case-specific random forests (CSRFs), which uses RF proximity for weighted bootstrapping, were trained. RESULTS: Compared to our previous study that investigated a cosine similarity PSM, the RF PSM resulted in superior or comparable predictive performance. RF and CSRF exhibited the best performances (in terms of mean area under the receiver operating characteristic curve [95% confidence interval], RF: 0.839 [0.835-0.844]; CSRF: 0.832 [0.821-0.843]). RF and CSRF did not benefit from personalization via the use of the RF PSM, while the other models did. CONCLUSIONS: The RF PSM led to good mortality prediction performance for several predictive models, although it failed to induce improved performance in RF and CSRF. The distinction between predictor and similarity variables is an important issue arising from the present study. RFs present a promising method for patient-specific outcome prediction. JMIR Publications 2017-01-17 /pmc/articles/PMC5285604/ /pubmed/28096065 http://dx.doi.org/10.2196/medinform.6690 Text en ©Joon Lee. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 17.01.2017. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Lee, Joon
Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
title Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
title_full Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
title_fullStr Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
title_full_unstemmed Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
title_short Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
title_sort patient-specific predictive modeling using random forests: an observational study for the critically ill
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5285604/
https://www.ncbi.nlm.nih.gov/pubmed/28096065
http://dx.doi.org/10.2196/medinform.6690
work_keys_str_mv AT leejoon patientspecificpredictivemodelingusingrandomforestsanobservationalstudyforthecriticallyill