Cargando…
Learning Individual Survival Models from PanCancer Whole Transcriptome Data
PURPOSE: Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with canc...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Association for Cancer Research
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10543961/ https://www.ncbi.nlm.nih.gov/pubmed/37463063 http://dx.doi.org/10.1158/1078-0432.CCR-22-3493 |
_version_ | 1785114398061232128 |
---|---|
author | Kumar, Neeraj Skubleny, Daniel Parkes, Michael Verma, Ruchika Davis, Sacha Kumar, Luke Aissiou, Amira Greiner, Russell |
author_facet | Kumar, Neeraj Skubleny, Daniel Parkes, Michael Verma, Ruchika Davis, Sacha Kumar, Luke Aissiou, Amira Greiner, Russell |
author_sort | Kumar, Neeraj |
collection | PubMed |
description | PURPOSE: Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with cancer. EXPERIMENTAL DESIGN: This article provides a method that learns a model for predicting the survival time for individual patients with cancer from the PanCancer Atlas: given the (16,335 dimensional) gene expression profiles from 10,173 patients, each having one of 33 cancers, this method uses unsupervised nonnegative matrix factorization (NMF) to reexpress the gene expression data for each patient in terms of 100 learned NMF factors. It then feeds these 100 factors into the Multi-Task Logistic Regression (MTLR) learner to produce cancer-specific models for each of 20 cancers (with >50 uncensored instances); this produces “individual survival distributions” (ISD), which provide survival probabilities at each future time for each individual patient, which provides a patient's risk score and estimated survival time. RESULTS: Our NMF-MTLR concordance indices outperformed the VAECox benchmark by 14.9% overall. We achieved optimal survival prediction using pan-cancer NMF in combination with cancer-specific MTLR models. We provide biological interpretation of the NMF model and clinical implications of ISDs for prognosis and therapeutic response prediction. CONCLUSIONS: NMF-MTLR provides many benefits over other models: superior model discrimination, superior calibration, meaningful survival time estimates, and accurate probabilistic estimates of survival over time for each individual patient. We advocate for the adoption of these cancer survival models in clinical and research settings. |
format | Online Article Text |
id | pubmed-10543961 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Association for Cancer Research |
record_format | MEDLINE/PubMed |
spelling | pubmed-105439612023-10-03 Learning Individual Survival Models from PanCancer Whole Transcriptome Data Kumar, Neeraj Skubleny, Daniel Parkes, Michael Verma, Ruchika Davis, Sacha Kumar, Luke Aissiou, Amira Greiner, Russell Clin Cancer Res Precision Medicine and Imaging PURPOSE: Personalized medicine attempts to predict survival time for each patient, based on their individual tumor molecular profile. We investigate whether our survival learner in combination with a dimension reduction method can produce useful survival estimates for a variety of patients with cancer. EXPERIMENTAL DESIGN: This article provides a method that learns a model for predicting the survival time for individual patients with cancer from the PanCancer Atlas: given the (16,335 dimensional) gene expression profiles from 10,173 patients, each having one of 33 cancers, this method uses unsupervised nonnegative matrix factorization (NMF) to reexpress the gene expression data for each patient in terms of 100 learned NMF factors. It then feeds these 100 factors into the Multi-Task Logistic Regression (MTLR) learner to produce cancer-specific models for each of 20 cancers (with >50 uncensored instances); this produces “individual survival distributions” (ISD), which provide survival probabilities at each future time for each individual patient, which provides a patient's risk score and estimated survival time. RESULTS: Our NMF-MTLR concordance indices outperformed the VAECox benchmark by 14.9% overall. We achieved optimal survival prediction using pan-cancer NMF in combination with cancer-specific MTLR models. We provide biological interpretation of the NMF model and clinical implications of ISDs for prognosis and therapeutic response prediction. CONCLUSIONS: NMF-MTLR provides many benefits over other models: superior model discrimination, superior calibration, meaningful survival time estimates, and accurate probabilistic estimates of survival over time for each individual patient. We advocate for the adoption of these cancer survival models in clinical and research settings. American Association for Cancer Research 2023-10-02 2023-07-18 /pmc/articles/PMC10543961/ /pubmed/37463063 http://dx.doi.org/10.1158/1078-0432.CCR-22-3493 Text en ©2023 The Authors; Published by the American Association for Cancer Research https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license. |
spellingShingle | Precision Medicine and Imaging Kumar, Neeraj Skubleny, Daniel Parkes, Michael Verma, Ruchika Davis, Sacha Kumar, Luke Aissiou, Amira Greiner, Russell Learning Individual Survival Models from PanCancer Whole Transcriptome Data |
title | Learning Individual Survival Models from PanCancer Whole Transcriptome Data |
title_full | Learning Individual Survival Models from PanCancer Whole Transcriptome Data |
title_fullStr | Learning Individual Survival Models from PanCancer Whole Transcriptome Data |
title_full_unstemmed | Learning Individual Survival Models from PanCancer Whole Transcriptome Data |
title_short | Learning Individual Survival Models from PanCancer Whole Transcriptome Data |
title_sort | learning individual survival models from pancancer whole transcriptome data |
topic | Precision Medicine and Imaging |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10543961/ https://www.ncbi.nlm.nih.gov/pubmed/37463063 http://dx.doi.org/10.1158/1078-0432.CCR-22-3493 |
work_keys_str_mv | AT kumarneeraj learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT skublenydaniel learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT parkesmichael learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT vermaruchika learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT davissacha learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT kumarluke learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT aissiouamira learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata AT greinerrussell learningindividualsurvivalmodelsfrompancancerwholetranscriptomedata |