Cargando…

Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data

SIMPLE SUMMARY: Here, we compared the performance of four different autoencoders: (a) vanilla, (b) sparse, (c) denoising, and (d) variational for subtype detection on four cancer types: Glioblastoma multiforme, Colon Adenocarcinoma, Kidney renal clear cell carcinoma, and Breast invasive carcinoma. M...

Descripción completa

Detalles Bibliográficos
Autores principales:	Franco, Edian F., Rana, Pratip, Cruz, Aline, Calderón, Víctor V., Azevedo, Vasco, Ramos, Rommel T. J., Ghosh, Preetam
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122584/ https://www.ncbi.nlm.nih.gov/pubmed/33921978 http://dx.doi.org/10.3390/cancers13092013

_version_	1783692657521852416
author	Franco, Edian F. Rana, Pratip Cruz, Aline Calderón, Víctor V. Azevedo, Vasco Ramos, Rommel T. J. Ghosh, Preetam
author_facet	Franco, Edian F. Rana, Pratip Cruz, Aline Calderón, Víctor V. Azevedo, Vasco Ramos, Rommel T. J. Ghosh, Preetam
author_sort	Franco, Edian F.
collection	PubMed
description	SIMPLE SUMMARY: Here, we compared the performance of four different autoencoders: (a) vanilla, (b) sparse, (c) denoising, and (d) variational for subtype detection on four cancer types: Glioblastoma multiforme, Colon Adenocarcinoma, Kidney renal clear cell carcinoma, and Breast invasive carcinoma. Multiview dataset comprising gene expression, DNA methylation, and miRNA expression from TCGA is fed into an autoencoder to get a compressed nonlinear representation. Then the clustering technique was applied on that compressed representation to reveal the subtype of cancer. Though different autoencoders’ performance varies on different datasets, they performed much better than standard data fusion techniques such as PCA, kernel PCA, and sparse PCA. ABSTRACT: A heterogeneous disease such as cancer is activated through multiple pathways and different perturbations. Depending upon the activated pathway(s), the survival of the patients varies significantly and shows different efficacy to various drugs. Therefore, cancer subtype detection using genomics level data is a significant research problem. Subtype detection is often a complex problem, and in most cases, needs multi-omics data fusion to achieve accurate subtyping. Different data fusion and subtyping approaches have been proposed over the years, such as kernel-based fusion, matrix factorization, and deep learning autoencoders. In this paper, we compared the performance of different deep learning autoencoders for cancer subtype detection. We performed cancer subtype detection on four different cancer types from The Cancer Genome Atlas (TCGA) datasets using four autoencoder implementations. We also predicted the optimal number of subtypes in a cancer type using the silhouette score and found that the detected subtypes exhibit significant differences in survival profiles. Furthermore, we compared the effect of feature selection and similarity measures for subtype detection. For further evaluation, we used the Glioblastoma multiforme (GBM) dataset and identified the differentially expressed genes in each of the subtypes. The results obtained are consistent with other genomic studies and can be corroborated with the involved pathways and biological functions. Thus, it shows that the results from the autoencoders, obtained through the interaction of different datatypes of cancer, can be used for the prediction and characterization of patient subgroups and survival profiles.
format	Online Article Text
id	pubmed-8122584
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-81225842021-05-16 Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data Franco, Edian F. Rana, Pratip Cruz, Aline Calderón, Víctor V. Azevedo, Vasco Ramos, Rommel T. J. Ghosh, Preetam Cancers (Basel) Article SIMPLE SUMMARY: Here, we compared the performance of four different autoencoders: (a) vanilla, (b) sparse, (c) denoising, and (d) variational for subtype detection on four cancer types: Glioblastoma multiforme, Colon Adenocarcinoma, Kidney renal clear cell carcinoma, and Breast invasive carcinoma. Multiview dataset comprising gene expression, DNA methylation, and miRNA expression from TCGA is fed into an autoencoder to get a compressed nonlinear representation. Then the clustering technique was applied on that compressed representation to reveal the subtype of cancer. Though different autoencoders’ performance varies on different datasets, they performed much better than standard data fusion techniques such as PCA, kernel PCA, and sparse PCA. ABSTRACT: A heterogeneous disease such as cancer is activated through multiple pathways and different perturbations. Depending upon the activated pathway(s), the survival of the patients varies significantly and shows different efficacy to various drugs. Therefore, cancer subtype detection using genomics level data is a significant research problem. Subtype detection is often a complex problem, and in most cases, needs multi-omics data fusion to achieve accurate subtyping. Different data fusion and subtyping approaches have been proposed over the years, such as kernel-based fusion, matrix factorization, and deep learning autoencoders. In this paper, we compared the performance of different deep learning autoencoders for cancer subtype detection. We performed cancer subtype detection on four different cancer types from The Cancer Genome Atlas (TCGA) datasets using four autoencoder implementations. We also predicted the optimal number of subtypes in a cancer type using the silhouette score and found that the detected subtypes exhibit significant differences in survival profiles. Furthermore, we compared the effect of feature selection and similarity measures for subtype detection. For further evaluation, we used the Glioblastoma multiforme (GBM) dataset and identified the differentially expressed genes in each of the subtypes. The results obtained are consistent with other genomic studies and can be corroborated with the involved pathways and biological functions. Thus, it shows that the results from the autoencoders, obtained through the interaction of different datatypes of cancer, can be used for the prediction and characterization of patient subgroups and survival profiles. MDPI 2021-04-22 /pmc/articles/PMC8122584/ /pubmed/33921978 http://dx.doi.org/10.3390/cancers13092013 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Franco, Edian F. Rana, Pratip Cruz, Aline Calderón, Víctor V. Azevedo, Vasco Ramos, Rommel T. J. Ghosh, Preetam Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
title	Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
title_full	Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
title_fullStr	Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
title_full_unstemmed	Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
title_short	Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
title_sort	performance comparison of deep learning autoencoders for cancer subtype detection using multi-omics data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122584/ https://www.ncbi.nlm.nih.gov/pubmed/33921978 http://dx.doi.org/10.3390/cancers13092013
work_keys_str_mv	AT francoedianf performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata AT ranapratip performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata AT cruzaline performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata AT calderonvictorv performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata AT azevedovasco performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata AT ramosrommeltj performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata AT ghoshpreetam performancecomparisonofdeeplearningautoencodersforcancersubtypedetectionusingmultiomicsdata

Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data

Ejemplares similares