Cargando…

Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data

BACKGROUND: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause...

Descripción completa

Detalles Bibliográficos
Autores principales: Hao, Jie, Kim, Youngsoon, Mallavarapu, Tejaswini, Oh, Jung Hun, Kang, Mingon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927105/
https://www.ncbi.nlm.nih.gov/pubmed/31865908
http://dx.doi.org/10.1186/s12920-019-0624-2
_version_ 1783482240482672640
author Hao, Jie
Kim, Youngsoon
Mallavarapu, Tejaswini
Oh, Jung Hun
Kang, Mingon
author_facet Hao, Jie
Kim, Youngsoon
Mallavarapu, Tejaswini
Oh, Jung Hun
Kang, Mingon
author_sort Hao, Jie
collection PubMed
description BACKGROUND: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause computational challenges to applying conventional survival analysis. RESULTS: We propose a novel biologically interpretable pathway-based sparse deep neural network, named Cox-PASNet, which integrates high-dimensional gene expression data and clinical data on a simple neural network architecture for survival analysis. Cox-PASNet is biologically interpretable where nodes in the neural network correspond to biological genes and pathways, while capturing the nonlinear and hierarchical effects of biological pathways associated with cancer patient survival. We also propose a heuristic optimization solution to train Cox-PASNet with HDLSS data. Cox-PASNet was intensively evaluated by comparing the predictive performance of current state-of-the-art methods on glioblastoma multiforme (GBM) and ovarian serous cystadenocarcinoma (OV) cancer. In the experiments, Cox-PASNet showed out-performance, compared to the benchmarking methods. Moreover, the neural network architecture of Cox-PASNet was biologically interpreted, and several significant prognostic factors of genes and biological pathways were identified. CONCLUSIONS: Cox-PASNet models biological mechanisms in the neural network by incorporating biological pathway databases and sparse coding. The neural network of Cox-PASNet can identify nonlinear and hierarchical associations of genomic and clinical data to cancer patient survival. The open-source code of Cox-PASNet in PyTorch implemented for training, evaluation, and model interpretation is available at: https://github.com/DataX-JieHao/Cox-PASNet.
format Online
Article
Text
id pubmed-6927105
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69271052019-12-30 Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data Hao, Jie Kim, Youngsoon Mallavarapu, Tejaswini Oh, Jung Hun Kang, Mingon BMC Med Genomics Research BACKGROUND: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause computational challenges to applying conventional survival analysis. RESULTS: We propose a novel biologically interpretable pathway-based sparse deep neural network, named Cox-PASNet, which integrates high-dimensional gene expression data and clinical data on a simple neural network architecture for survival analysis. Cox-PASNet is biologically interpretable where nodes in the neural network correspond to biological genes and pathways, while capturing the nonlinear and hierarchical effects of biological pathways associated with cancer patient survival. We also propose a heuristic optimization solution to train Cox-PASNet with HDLSS data. Cox-PASNet was intensively evaluated by comparing the predictive performance of current state-of-the-art methods on glioblastoma multiforme (GBM) and ovarian serous cystadenocarcinoma (OV) cancer. In the experiments, Cox-PASNet showed out-performance, compared to the benchmarking methods. Moreover, the neural network architecture of Cox-PASNet was biologically interpreted, and several significant prognostic factors of genes and biological pathways were identified. CONCLUSIONS: Cox-PASNet models biological mechanisms in the neural network by incorporating biological pathway databases and sparse coding. The neural network of Cox-PASNet can identify nonlinear and hierarchical associations of genomic and clinical data to cancer patient survival. The open-source code of Cox-PASNet in PyTorch implemented for training, evaluation, and model interpretation is available at: https://github.com/DataX-JieHao/Cox-PASNet. BioMed Central 2019-12-23 /pmc/articles/PMC6927105/ /pubmed/31865908 http://dx.doi.org/10.1186/s12920-019-0624-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Hao, Jie
Kim, Youngsoon
Mallavarapu, Tejaswini
Oh, Jung Hun
Kang, Mingon
Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
title Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
title_full Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
title_fullStr Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
title_full_unstemmed Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
title_short Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
title_sort interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6927105/
https://www.ncbi.nlm.nih.gov/pubmed/31865908
http://dx.doi.org/10.1186/s12920-019-0624-2
work_keys_str_mv AT haojie interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata
AT kimyoungsoon interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata
AT mallavaraputejaswini interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata
AT ohjunghun interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata
AT kangmingon interpretabledeepneuralnetworkforcancersurvivalanalysisbyintegratinggenomicandclinicaldata