Cargando…
Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm
The present study proposed a deep learning (DL) algorithm to predict survival in patients with colon adenocarcinoma (COAD) based on multiomics integration. The survival-sensitive model was constructed using an autoencoder for DL implementation based on The Cancer Genome Atlas (TCGA) data of patients...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Portland Press Ltd.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7753845/ https://www.ncbi.nlm.nih.gov/pubmed/33258470 http://dx.doi.org/10.1042/BSR20201482 |
_version_ | 1783626075049295872 |
---|---|
author | Lv, Jiudi Wang, Junjie Shang, Xiujuan Liu, Fangfang Guo, Shixun |
author_facet | Lv, Jiudi Wang, Junjie Shang, Xiujuan Liu, Fangfang Guo, Shixun |
author_sort | Lv, Jiudi |
collection | PubMed |
description | The present study proposed a deep learning (DL) algorithm to predict survival in patients with colon adenocarcinoma (COAD) based on multiomics integration. The survival-sensitive model was constructed using an autoencoder for DL implementation based on The Cancer Genome Atlas (TCGA) data of patients with COAD. The autoencoder framework was compared with PCA, NMF, t-SNE, and univariable Cox-PH model for identifying survival-related features. The prognostic robustness of the inferred survival risk groups was validated using three independent confirmation cohorts. Differential expression analysis, Pearson’s correlation analysis, construction of miRNA–target gene network, and function enrichment analysis were performed. Two risk groups with significant survival differences were identified in TCGA set using the autoencoder-based model (log-rank P-value = 5.51e(−07)). The autoencoder framework showed superior performance compared with PCA, NMF, t-SNE, and the univariable Cox-PH model based on the C-index, log-rank P-value, and Brier score. The robustness of the classification model was successfully verified in three independent validation sets. There were 1271 differentially expressed genes, 10 differentially expressed miRNAs, and 12 hypermethylated genes between the survival risk groups. Among these, miR-133b and its target genes (GNB4, PTPRZ1, RUNX1T1, EPHA7, GPM6A, BICC1, and ADAMTS5) were used to construct a network. These genes were significantly enriched in ECM–receptor interaction, focal adhesion, PI3K–Akt signaling pathway, and glucose metabolism-related pathways. The risk subgroups obtained through a multiomics data integration pipeline using the DL algorithm had good robustness. miR-133b and its target genes could be potential diagnostic markers. The results would assist in elucidating the possible pathogenesis of COAD. |
format | Online Article Text |
id | pubmed-7753845 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Portland Press Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-77538452021-01-05 Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm Lv, Jiudi Wang, Junjie Shang, Xiujuan Liu, Fangfang Guo, Shixun Biosci Rep Bioinformatics The present study proposed a deep learning (DL) algorithm to predict survival in patients with colon adenocarcinoma (COAD) based on multiomics integration. The survival-sensitive model was constructed using an autoencoder for DL implementation based on The Cancer Genome Atlas (TCGA) data of patients with COAD. The autoencoder framework was compared with PCA, NMF, t-SNE, and univariable Cox-PH model for identifying survival-related features. The prognostic robustness of the inferred survival risk groups was validated using three independent confirmation cohorts. Differential expression analysis, Pearson’s correlation analysis, construction of miRNA–target gene network, and function enrichment analysis were performed. Two risk groups with significant survival differences were identified in TCGA set using the autoencoder-based model (log-rank P-value = 5.51e(−07)). The autoencoder framework showed superior performance compared with PCA, NMF, t-SNE, and the univariable Cox-PH model based on the C-index, log-rank P-value, and Brier score. The robustness of the classification model was successfully verified in three independent validation sets. There were 1271 differentially expressed genes, 10 differentially expressed miRNAs, and 12 hypermethylated genes between the survival risk groups. Among these, miR-133b and its target genes (GNB4, PTPRZ1, RUNX1T1, EPHA7, GPM6A, BICC1, and ADAMTS5) were used to construct a network. These genes were significantly enriched in ECM–receptor interaction, focal adhesion, PI3K–Akt signaling pathway, and glucose metabolism-related pathways. The risk subgroups obtained through a multiomics data integration pipeline using the DL algorithm had good robustness. miR-133b and its target genes could be potential diagnostic markers. The results would assist in elucidating the possible pathogenesis of COAD. Portland Press Ltd. 2020-12-21 /pmc/articles/PMC7753845/ /pubmed/33258470 http://dx.doi.org/10.1042/BSR20201482 Text en © 2020 The Author(s). https://creativecommons.org/licenses/by/4.0/ This is an open access article published by Portland Press Limited on behalf of the Biochemical Society and distributed under the . |
spellingShingle | Bioinformatics Lv, Jiudi Wang, Junjie Shang, Xiujuan Liu, Fangfang Guo, Shixun Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
title | Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
title_full | Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
title_fullStr | Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
title_full_unstemmed | Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
title_short | Survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
title_sort | survival prediction in patients with colon adenocarcinoma via multiomics data integration using a deep learning algorithm |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7753845/ https://www.ncbi.nlm.nih.gov/pubmed/33258470 http://dx.doi.org/10.1042/BSR20201482 |
work_keys_str_mv | AT lvjiudi survivalpredictioninpatientswithcolonadenocarcinomaviamultiomicsdataintegrationusingadeeplearningalgorithm AT wangjunjie survivalpredictioninpatientswithcolonadenocarcinomaviamultiomicsdataintegrationusingadeeplearningalgorithm AT shangxiujuan survivalpredictioninpatientswithcolonadenocarcinomaviamultiomicsdataintegrationusingadeeplearningalgorithm AT liufangfang survivalpredictioninpatientswithcolonadenocarcinomaviamultiomicsdataintegrationusingadeeplearningalgorithm AT guoshixun survivalpredictioninpatientswithcolonadenocarcinomaviamultiomicsdataintegrationusingadeeplearningalgorithm |