Cargando…

Learning Individual Survival Models from PanCancer Whole Transcriptome Data

PURPOSE: Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with canc...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumar, Neeraj, Skubleny, Daniel, Parkes, Michael, Verma, Ruchika, Davis, Sacha, Kumar, Luke, Aissiou, Amira, Greiner, Russell
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for Cancer Research 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10543961/
https://www.ncbi.nlm.nih.gov/pubmed/37463063
http://dx.doi.org/10.1158/1078-0432.CCR-22-3493
_version_ 1785114398061232128
author Kumar, Neeraj
Skubleny, Daniel
Parkes, Michael
Verma, Ruchika
Davis, Sacha
Kumar, Luke
Aissiou, Amira
Greiner, Russell
author_facet Kumar, Neeraj
Skubleny, Daniel
Parkes, Michael
Verma, Ruchika
Davis, Sacha
Kumar, Luke
Aissiou, Amira
Greiner, Russell
author_sort Kumar, Neeraj
collection PubMed
description PURPOSE: Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with cancer. EXPERIMENTAL DESIGN: This article provides a method that learns a model for predicting the survival time for individual patients with cancer from the PanCancer Atlas: given the (16,335 dimensional) gene expression profiles from 10,173 patients, each having one of 33 cancers, this method uses unsupervised nonnegative matrix factorization (NMF) to reexpress the gene expression data for each patient in terms of 100 learned NMF factors. It then feeds these 100 factors into the Multi-Task Logistic Regression (MTLR) learner to produce cancer-specific models for each of 20 cancers (with >50 uncensored instances); this produces “individual survival distributions” (ISD), which provide survival probabilities at each future time for each individual patient, which provides a patient's risk score and estimated survival time. RESULTS: Our NMF-MTLR concordance indices outperformed the VAECox benchmark by 14.9% overall. We achieved optimal survival prediction using pan-cancer NMF in combination with cancer-specific MTLR models. We provide biological interpretation of the NMF model and clinical implications of ISDs for prognosis and therapeutic response prediction. CONCLUSIONS: NMF-MTLR provides many benefits over other models: superior model discrimination, superior calibration, meaningful survival time estimates, and accurate probabilistic estimates of survival over time for each individual patient. We advocate for the adoption of these cancer survival models in clinical and research settings.
format Online
Article
Text
id pubmed-10543961
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Association for Cancer Research
record_format MEDLINE/PubMed
spelling pubmed-105439612023-10-03 Learning Individual Survival Models from PanCancer Whole Transcriptome Data Kumar, Neeraj Skubleny, Daniel Parkes, Michael Verma, Ruchika Davis, Sacha Kumar, Luke Aissiou, Amira Greiner, Russell Clin Cancer Res Precision Medicine and Imaging PURPOSE: Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with cancer. EXPERIMENTAL DESIGN: This article provides a method that learns a model for predicting the survival time for individual patients with cancer from the PanCancer Atlas: given the (16,335 dimensional) gene expression profiles from 10,173 patients, each having one of 33 cancers, this method uses unsupervised nonnegative matrix factorization (NMF) to reexpress the gene expression data for each patient in terms of 100 learned NMF factors. It then feeds these 100 factors into the Multi-Task Logistic Regression (MTLR) learner to produce cancer-specific models for each of 20 cancers (with >50 uncensored instances); this produces “individual survival distributions” (ISD), which provide survival probabilities at each future time for each individual patient, which provides a patient's risk score and estimated survival time. RESULTS: Our NMF-MTLR concordance indices outperformed the VAECox benchmark by 14.9% overall. We achieved optimal survival prediction using pan-cancer NMF in combination with cancer-specific MTLR models. We provide biological interpretation of the NMF model and clinical implications of ISDs for prognosis and therapeutic response prediction. CONCLUSIONS: NMF-MTLR provides many benefits over other models: superior model discrimination, superior calibration, meaningful survival time estimates, and accurate probabilistic estimates of survival over time for each individual patient. We advocate for the adoption of these cancer survival models in clinical and research settings. American Association for Cancer Research 2023-10-02 2023-07-18 /pmc/articles/PMC10543961/ /pubmed/37463063 http://dx.doi.org/10.1158/1078-0432.CCR-22-3493 Text en ©2023 The Authors; Published by the American Association for Cancer Research https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
spellingShingle Precision Medicine and Imaging
Kumar, Neeraj
Skubleny, Daniel
Parkes, Michael
Verma, Ruchika
Davis, Sacha
Kumar, Luke
Aissiou, Amira
Greiner, Russell
Learning Individual Survival Models from PanCancer Whole Transcriptome Data
title Learning Individual Survival Models from PanCancer Whole Transcriptome Data
title_full Learning Individual Survival Models from PanCancer Whole Transcriptome Data
title_fullStr Learning Individual Survival Models from PanCancer Whole Transcriptome Data
title_full_unstemmed Learning Individual Survival Models from PanCancer Whole Transcriptome Data
title_short Learning Individual Survival Models from PanCancer Whole Transcriptome Data
title_sort learning individual survival models from pancancer whole transcriptome data
topic Precision Medicine and Imaging
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10543961/
https://www.ncbi.nlm.nih.gov/pubmed/37463063
http://dx.doi.org/10.1158/1078-0432.CCR-22-3493
work_keys_str_mv AT kumarneeraj learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT skublenydaniel learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT parkesmichael learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT vermaruchika learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT davissacha learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT kumarluke learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT aissiouamira learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata
AT greinerrussell learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata