Cargando…
Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
OBJECTIVE: Like most real-world data, electronic health record (EHR)–derived data from oncology patients typically exhibits wide interpatient variability in terms of available data elements. This interpatient variability leads to missing data and can present critical challenges in developing and imp...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7936528/ https://www.ncbi.nlm.nih.gov/pubmed/33260202 http://dx.doi.org/10.1093/jamia/ocaa254 |
_version_ | 1783661204901724160 |
---|---|
author | Baron, Jason M Paranjape, Ketan Love, Tara Sharma, Vishakha Heaney, Denise Prime, Matthew |
author_facet | Baron, Jason M Paranjape, Ketan Love, Tara Sharma, Vishakha Heaney, Denise Prime, Matthew |
author_sort | Baron, Jason M |
collection | PubMed |
description | OBJECTIVE: Like most real-world data, electronic health record (EHR)–derived data from oncology patients typically exhibits wide interpatient variability in terms of available data elements. This interpatient variability leads to missing data and can present critical challenges in developing and implementing predictive models to underlie clinical decision support for patient-specific oncology care. Here, we sought to develop a novel ensemble approach to addressing missing data that we term the “meta-model” and apply the meta-model to patient-specific cancer prognosis. MATERIALS AND METHODS: Using real-world data, we developed a suite of individual random survival forest models to predict survival in patients with advanced lung cancer, colorectal cancer, and breast cancer. Individual models varied by the predictor data used. We combined models for each cancer type into a meta-model that predicted survival for each patient using a weighted mean of the individual models for which the patient had all requisite predictors. RESULTS: The meta-model significantly outperformed many of the individual models and performed similarly to the best performing individual models. Comparisons of the meta-model to a more traditional imputation-based method of addressing missing data supported the meta-model’s utility. CONCLUSIONS: We developed a novel machine learning–based strategy to underlie clinical decision support and predict survival in cancer patients, despite missing data. The meta-model may more generally provide a tool for addressing missing data across a variety of clinical prediction problems. Moreover, the meta-model may address other challenges in clinical predictive modeling including model extensibility and integration of predictive algorithms trained across different institutions and datasets. |
format | Online Article Text |
id | pubmed-7936528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-79365282021-03-10 Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support Baron, Jason M Paranjape, Ketan Love, Tara Sharma, Vishakha Heaney, Denise Prime, Matthew J Am Med Inform Assoc Research and Applications OBJECTIVE: Like most real-world data, electronic health record (EHR)–derived data from oncology patients typically exhibits wide interpatient variability in terms of available data elements. This interpatient variability leads to missing data and can present critical challenges in developing and implementing predictive models to underlie clinical decision support for patient-specific oncology care. Here, we sought to develop a novel ensemble approach to addressing missing data that we term the “meta-model” and apply the meta-model to patient-specific cancer prognosis. MATERIALS AND METHODS: Using real-world data, we developed a suite of individual random survival forest models to predict survival in patients with advanced lung cancer, colorectal cancer, and breast cancer. Individual models varied by the predictor data used. We combined models for each cancer type into a meta-model that predicted survival for each patient using a weighted mean of the individual models for which the patient had all requisite predictors. RESULTS: The meta-model significantly outperformed many of the individual models and performed similarly to the best performing individual models. Comparisons of the meta-model to a more traditional imputation-based method of addressing missing data supported the meta-model’s utility. CONCLUSIONS: We developed a novel machine learning–based strategy to underlie clinical decision support and predict survival in cancer patients, despite missing data. The meta-model may more generally provide a tool for addressing missing data across a variety of clinical prediction problems. Moreover, the meta-model may address other challenges in clinical predictive modeling including model extensibility and integration of predictive algorithms trained across different institutions and datasets. Oxford University Press 2020-12-01 /pmc/articles/PMC7936528/ /pubmed/33260202 http://dx.doi.org/10.1093/jamia/ocaa254 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Research and Applications Baron, Jason M Paranjape, Ketan Love, Tara Sharma, Vishakha Heaney, Denise Prime, Matthew Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
title | Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
title_full | Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
title_fullStr | Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
title_full_unstemmed | Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
title_short | Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
title_sort | development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7936528/ https://www.ncbi.nlm.nih.gov/pubmed/33260202 http://dx.doi.org/10.1093/jamia/ocaa254 |
work_keys_str_mv | AT baronjasonm developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport AT paranjapeketan developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport AT lovetara developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport AT sharmavishakha developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport AT heaneydenise developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport AT primematthew developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport |