Cargando…

Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis

Survival and second malignancy prediction models can aid clinical decision making. Most commonly, survival analysis studies are performed using traditional proportional hazards models, which require strong assumptions and can lead to biased estimates if violated. Therefore, this study aims to implem...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ivy Y., Hart, Gregory R., Qin, Bo, Deng, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894907/
https://www.ncbi.nlm.nih.gov/pubmed/36732358
http://dx.doi.org/10.1038/s41598-023-29167-x
_version_ 1784881833982296064
author Zhang, Ivy Y.
Hart, Gregory R.
Qin, Bo
Deng, Jun
author_facet Zhang, Ivy Y.
Hart, Gregory R.
Qin, Bo
Deng, Jun
author_sort Zhang, Ivy Y.
collection PubMed
description Survival and second malignancy prediction models can aid clinical decision making. Most commonly, survival analysis studies are performed using traditional proportional hazards models, which require strong assumptions and can lead to biased estimates if violated. Therefore, this study aims to implement an alternative, machine learning (ML) model for survival analysis: Random Survival Forest (RSF). In this study, RSFs were built using the U.S. Surveillance Epidemiology and End Results to (1) predict 30-year survival in pediatric, adolescent, and young adult cancer survivors; and (2) predict risk and site of a second tumor within 30 years of the first tumor diagnosis in these age groups. The final RSF model for pediatric, adolescent, and young adult survival has an average Concordance index (C-index) of 92.9%, 94.2%, and 94.4% and average time-dependent area under the receiver operating characteristic curve (AUC) at 30-years since first diagnosis of 90.8%, 93.6%, 96.1% respectively. The final RSF model for pediatric, adolescent, and young adult second malignancy has an average C-index of 86.8%, 85.2%, and 88.6% and average time-dependent AUC at 30-years since first diagnosis of 76.5%, 88.1%, and 99.0% respectively. This study suggests the robustness and potential clinical value of ML models to alleviate physician burden by quickly identifying highest risk individuals.
format Online
Article
Text
id pubmed-9894907
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98949072023-02-04 Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis Zhang, Ivy Y. Hart, Gregory R. Qin, Bo Deng, Jun Sci Rep Article Survival and second malignancy prediction models can aid clinical decision making. Most commonly, survival analysis studies are performed using traditional proportional hazards models, which require strong assumptions and can lead to biased estimates if violated. Therefore, this study aims to implement an alternative, machine learning (ML) model for survival analysis: Random Survival Forest (RSF). In this study, RSFs were built using the U.S. Surveillance Epidemiology and End Results to (1) predict 30-year survival in pediatric, adolescent, and young adult cancer survivors; and (2) predict risk and site of a second tumor within 30 years of the first tumor diagnosis in these age groups. The final RSF model for pediatric, adolescent, and young adult survival has an average Concordance index (C-index) of 92.9%, 94.2%, and 94.4% and average time-dependent area under the receiver operating characteristic curve (AUC) at 30-years since first diagnosis of 90.8%, 93.6%, 96.1% respectively. The final RSF model for pediatric, adolescent, and young adult second malignancy has an average C-index of 86.8%, 85.2%, and 88.6% and average time-dependent AUC at 30-years since first diagnosis of 76.5%, 88.1%, and 99.0% respectively. This study suggests the robustness and potential clinical value of ML models to alleviate physician burden by quickly identifying highest risk individuals. Nature Publishing Group UK 2023-02-02 /pmc/articles/PMC9894907/ /pubmed/36732358 http://dx.doi.org/10.1038/s41598-023-29167-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhang, Ivy Y.
Hart, Gregory R.
Qin, Bo
Deng, Jun
Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis
title Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis
title_full Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis
title_fullStr Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis
title_full_unstemmed Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis
title_short Long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using Random Survival Forests: a SEER analysis
title_sort long-term survival and second malignant tumor prediction in pediatric, adolescent, and young adult cancer survivors using random survival forests: a seer analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894907/
https://www.ncbi.nlm.nih.gov/pubmed/36732358
http://dx.doi.org/10.1038/s41598-023-29167-x
work_keys_str_mv AT zhangivyy longtermsurvivalandsecondmalignanttumorpredictioninpediatricadolescentandyoungadultcancersurvivorsusingrandomsurvivalforestsaseeranalysis
AT hartgregoryr longtermsurvivalandsecondmalignanttumorpredictioninpediatricadolescentandyoungadultcancersurvivorsusingrandomsurvivalforestsaseeranalysis
AT qinbo longtermsurvivalandsecondmalignanttumorpredictioninpediatricadolescentandyoungadultcancersurvivorsusingrandomsurvivalforestsaseeranalysis
AT dengjun longtermsurvivalandsecondmalignanttumorpredictioninpediatricadolescentandyoungadultcancersurvivorsusingrandomsurvivalforestsaseeranalysis