Cargando…
Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
BACKGROUND: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927105/ https://www.ncbi.nlm.nih.gov/pubmed/31865908 http://dx.doi.org/10.1186/s12920-019-0624-2 |
_version_ | 1783482240482672640 |
---|---|
author | Hao, Jie Kim, Youngsoon Mallavarapu, Tejaswini Oh, Jung Hun Kang, Mingon |
author_facet | Hao, Jie Kim, Youngsoon Mallavarapu, Tejaswini Oh, Jung Hun Kang, Mingon |
author_sort | Hao, Jie |
collection | PubMed |
description | BACKGROUND: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause computational challenges to applying conventional survival analysis. RESULTS: We propose a novel biologically interpretable pathway-based sparse deep neural network, named Cox-PASNet, which integrates high-dimensional gene expression data and clinical data on a simple neural network architecture for survival analysis. Cox-PASNet is biologically interpretable where nodes in the neural network correspond to biological genes and pathways, while capturing the nonlinear and hierarchical effects of biological pathways associated with cancer patient survival. We also propose a heuristic optimization solution to train Cox-PASNet with HDLSS data. Cox-PASNet was intensively evaluated by comparing the predictive performance of current state-of-the-art methods on glioblastoma multiforme (GBM) and ovarian serous cystadenocarcinoma (OV) cancer. In the experiments, Cox-PASNet showed out-performance, compared to the benchmarking methods. Moreover, the neural network architecture of Cox-PASNet was biologically interpreted, and several significant prognostic factors of genes and biological pathways were identified. CONCLUSIONS: Cox-PASNet models biological mechanisms in the neural network by incorporating biological pathway databases and sparse coding. The neural network of Cox-PASNet can identify nonlinear and hierarchical associations of genomic and clinical data to cancer patient survival. The open-source code of Cox-PASNet in PyTorch implemented for training, evaluation, and model interpretation is available at: https://github.com/DataX-JieHao/Cox-PASNet. |
format | Online Article Text |
id | pubmed-6927105 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69271052019-12-30 Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data Hao, Jie Kim, Youngsoon Mallavarapu, Tejaswini Oh, Jung Hun Kang, Mingon BMC Med Genomics Research BACKGROUND: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause computational challenges to applying conventional survival analysis. RESULTS: We propose a novel biologically interpretable pathway-based sparse deep neural network, named Cox-PASNet, which integrates high-dimensional gene expression data and clinical data on a simple neural network architecture for survival analysis. Cox-PASNet is biologically interpretable where nodes in the neural network correspond to biological genes and pathways, while capturing the nonlinear and hierarchical effects of biological pathways associated with cancer patient survival. We also propose a heuristic optimization solution to train Cox-PASNet with HDLSS data. Cox-PASNet was intensively evaluated by comparing the predictive performance of current state-of-the-art methods on glioblastoma multiforme (GBM) and ovarian serous cystadenocarcinoma (OV) cancer. In the experiments, Cox-PASNet showed out-performance, compared to the benchmarking methods. Moreover, the neural network architecture of Cox-PASNet was biologically interpreted, and several significant prognostic factors of genes and biological pathways were identified. CONCLUSIONS: Cox-PASNet models biological mechanisms in the neural network by incorporating biological pathway databases and sparse coding. The neural network of Cox-PASNet can identify nonlinear and hierarchical associations of genomic and clinical data to cancer patient survival. The open-source code of Cox-PASNet in PyTorch implemented for training, evaluation, and model interpretation is available at: https://github.com/DataX-JieHao/Cox-PASNet. BioMed Central 2019-12-23 /pmc/articles/PMC6927105/ /pubmed/31865908 http://dx.doi.org/10.1186/s12920-019-0624-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hao, Jie Kim, Youngsoon Mallavarapu, Tejaswini Oh, Jung Hun Kang, Mingon Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
title | Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
title_full | Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
title_fullStr | Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
title_full_unstemmed | Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
title_short | Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
title_sort | interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927105/ https://www.ncbi.nlm.nih.gov/pubmed/31865908 http://dx.doi.org/10.1186/s12920-019-0624-2 |
work_keys_str_mv | AT haojie interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata AT kimyoungsoon interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata AT mallavaraputejaswini interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata AT ohjunghun interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata AT kangmingon interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata |