Cargando…

Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support

OBJECTIVE: Like most real-world data, electronic health record (EHR)–derived data from oncology patients typically exhibits wide interpatient variability in terms of available data elements. This interpatient variability leads to missing data and can present critical challenges in developing and imp...

Descripción completa

Detalles Bibliográficos
Autores principales: Baron, Jason M, Paranjape, Ketan, Love, Tara, Sharma, Vishakha, Heaney, Denise, Prime, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7936528/
https://www.ncbi.nlm.nih.gov/pubmed/33260202
http://dx.doi.org/10.1093/jamia/ocaa254
_version_ 1783661204901724160
author Baron, Jason M
Paranjape, Ketan
Love, Tara
Sharma, Vishakha
Heaney, Denise
Prime, Matthew
author_facet Baron, Jason M
Paranjape, Ketan
Love, Tara
Sharma, Vishakha
Heaney, Denise
Prime, Matthew
author_sort Baron, Jason M
collection PubMed
description OBJECTIVE: Like most real-world data, electronic health record (EHR)–derived data from oncology patients typically exhibits wide interpatient variability in terms of available data elements. This interpatient variability leads to missing data and can present critical challenges in developing and implementing predictive models to underlie clinical decision support for patient-specific oncology care. Here, we sought to develop a novel ensemble approach to addressing missing data that we term the “meta-model” and apply the meta-model to patient-specific cancer prognosis. MATERIALS AND METHODS: Using real-world data, we developed a suite of individual random survival forest models to predict survival in patients with advanced lung cancer, colorectal cancer, and breast cancer. Individual models varied by the predictor data used. We combined models for each cancer type into a meta-model that predicted survival for each patient using a weighted mean of the individual models for which the patient had all requisite predictors. RESULTS: The meta-model significantly outperformed many of the individual models and performed similarly to the best performing individual models. Comparisons of the meta-model to a more traditional imputation-based method of addressing missing data supported the meta-model’s utility. CONCLUSIONS: We developed a novel machine learning–based strategy to underlie clinical decision support and predict survival in cancer patients, despite missing data. The meta-model may more generally provide a tool for addressing missing data across a variety of clinical prediction problems. Moreover, the meta-model may address other challenges in clinical predictive modeling including model extensibility and integration of predictive algorithms trained across different institutions and datasets.
format Online
Article
Text
id pubmed-7936528
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-79365282021-03-10 Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support Baron, Jason M Paranjape, Ketan Love, Tara Sharma, Vishakha Heaney, Denise Prime, Matthew J Am Med Inform Assoc Research and Applications OBJECTIVE: Like most real-world data, electronic health record (EHR)–derived data from oncology patients typically exhibits wide interpatient variability in terms of available data elements. This interpatient variability leads to missing data and can present critical challenges in developing and implementing predictive models to underlie clinical decision support for patient-specific oncology care. Here, we sought to develop a novel ensemble approach to addressing missing data that we term the “meta-model” and apply the meta-model to patient-specific cancer prognosis. MATERIALS AND METHODS: Using real-world data, we developed a suite of individual random survival forest models to predict survival in patients with advanced lung cancer, colorectal cancer, and breast cancer. Individual models varied by the predictor data used. We combined models for each cancer type into a meta-model that predicted survival for each patient using a weighted mean of the individual models for which the patient had all requisite predictors. RESULTS: The meta-model significantly outperformed many of the individual models and performed similarly to the best performing individual models. Comparisons of the meta-model to a more traditional imputation-based method of addressing missing data supported the meta-model’s utility. CONCLUSIONS: We developed a novel machine learning–based strategy to underlie clinical decision support and predict survival in cancer patients, despite missing data. The meta-model may more generally provide a tool for addressing missing data across a variety of clinical prediction problems. Moreover, the meta-model may address other challenges in clinical predictive modeling including model extensibility and integration of predictive algorithms trained across different institutions and datasets. Oxford University Press 2020-12-01 /pmc/articles/PMC7936528/ /pubmed/33260202 http://dx.doi.org/10.1093/jamia/ocaa254 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Baron, Jason M
Paranjape, Ketan
Love, Tara
Sharma, Vishakha
Heaney, Denise
Prime, Matthew
Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
title Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
title_full Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
title_fullStr Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
title_full_unstemmed Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
title_short Development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
title_sort development of a “meta-model” to address missing data, predict patient-specific cancer survival and provide a foundation for clinical decision support
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7936528/
https://www.ncbi.nlm.nih.gov/pubmed/33260202
http://dx.doi.org/10.1093/jamia/ocaa254
work_keys_str_mv AT baronjasonm developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport
AT paranjapeketan developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport
AT lovetara developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport
AT sharmavishakha developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport
AT heaneydenise developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport
AT primematthew developmentofametamodeltoaddressmissingdatapredictpatientspecificcancersurvivalandprovideafoundationforclinicaldecisionsupport