Cargando…
Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill
BACKGROUND: With a large-scale electronic health record repository, it is feasible to build a customized patient outcome prediction model specifically for a given patient. This approach involves identifying past patients who are similar to the present patient and using their data to train a personal...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5285604/ https://www.ncbi.nlm.nih.gov/pubmed/28096065 http://dx.doi.org/10.2196/medinform.6690 |
_version_ | 1782503861907357696 |
---|---|
author | Lee, Joon |
author_facet | Lee, Joon |
author_sort | Lee, Joon |
collection | PubMed |
description | BACKGROUND: With a large-scale electronic health record repository, it is feasible to build a customized patient outcome prediction model specifically for a given patient. This approach involves identifying past patients who are similar to the present patient and using their data to train a personalized predictive model. Our previous work investigated a cosine-similarity patient similarity metric (PSM) for such patient-specific predictive modeling. OBJECTIVE: The objective of the study is to investigate the random forest (RF) proximity measure as a PSM in the context of personalized mortality prediction for intensive care unit (ICU) patients. METHODS: A total of 17,152 ICU admissions were extracted from the Multiparameter Intelligent Monitoring in Intensive Care II database. A number of predictor variables were extracted from the first 24 hours in the ICU. Outcome to be predicted was 30-day mortality. A patient-specific predictive model was trained for each ICU admission using an RF PSM inspired by the RF proximity measure. Death counting, logistic regression, decision tree, and RF models were studied with a hard threshold applied to RF PSM values to only include the M most similar patients in model training, where M was varied. In addition, case-specific random forests (CSRFs), which uses RF proximity for weighted bootstrapping, were trained. RESULTS: Compared to our previous study that investigated a cosine similarity PSM, the RF PSM resulted in superior or comparable predictive performance. RF and CSRF exhibited the best performances (in terms of mean area under the receiver operating characteristic curve [95% confidence interval], RF: 0.839 [0.835-0.844]; CSRF: 0.832 [0.821-0.843]). RF and CSRF did not benefit from personalization via the use of the RF PSM, while the other models did. CONCLUSIONS: The RF PSM led to good mortality prediction performance for several predictive models, although it failed to induce improved performance in RF and CSRF. The distinction between predictor and similarity variables is an important issue arising from the present study. RFs present a promising method for patient-specific outcome prediction. |
format | Online Article Text |
id | pubmed-5285604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-52856042017-02-15 Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill Lee, Joon JMIR Med Inform Original Paper BACKGROUND: With a large-scale electronic health record repository, it is feasible to build a customized patient outcome prediction model specifically for a given patient. This approach involves identifying past patients who are similar to the present patient and using their data to train a personalized predictive model. Our previous work investigated a cosine-similarity patient similarity metric (PSM) for such patient-specific predictive modeling. OBJECTIVE: The objective of the study is to investigate the random forest (RF) proximity measure as a PSM in the context of personalized mortality prediction for intensive care unit (ICU) patients. METHODS: A total of 17,152 ICU admissions were extracted from the Multiparameter Intelligent Monitoring in Intensive Care II database. A number of predictor variables were extracted from the first 24 hours in the ICU. Outcome to be predicted was 30-day mortality. A patient-specific predictive model was trained for each ICU admission using an RF PSM inspired by the RF proximity measure. Death counting, logistic regression, decision tree, and RF models were studied with a hard threshold applied to RF PSM values to only include the M most similar patients in model training, where M was varied. In addition, case-specific random forests (CSRFs), which uses RF proximity for weighted bootstrapping, were trained. RESULTS: Compared to our previous study that investigated a cosine similarity PSM, the RF PSM resulted in superior or comparable predictive performance. RF and CSRF exhibited the best performances (in terms of mean area under the receiver operating characteristic curve [95% confidence interval], RF: 0.839 [0.835-0.844]; CSRF: 0.832 [0.821-0.843]). RF and CSRF did not benefit from personalization via the use of the RF PSM, while the other models did. CONCLUSIONS: The RF PSM led to good mortality prediction performance for several predictive models, although it failed to induce improved performance in RF and CSRF. The distinction between predictor and similarity variables is an important issue arising from the present study. RFs present a promising method for patient-specific outcome prediction. JMIR Publications 2017-01-17 /pmc/articles/PMC5285604/ /pubmed/28096065 http://dx.doi.org/10.2196/medinform.6690 Text en ©Joon Lee. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 17.01.2017. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Lee, Joon Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill |
title | Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill |
title_full | Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill |
title_fullStr | Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill |
title_full_unstemmed | Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill |
title_short | Patient-Specific Predictive Modeling Using Random Forests: An Observational Study for the Critically Ill |
title_sort | patient-specific predictive modeling using random forests: an observational study for the critically ill |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5285604/ https://www.ncbi.nlm.nih.gov/pubmed/28096065 http://dx.doi.org/10.2196/medinform.6690 |
work_keys_str_mv | AT leejoon patientspecificpredictivemodelingusingrandomforestsanobservationalstudyforthecriticallyill |