Cargando…

Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma

Precise biomarker development is a key step in disease management. However, most of the published biomarkers were derived from a relatively small number of samples with supervised approaches. Recent advances in unsupervised machine learning promise to leverage very large datasets for making better p...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jun, Xie, Xueying, Shi, Junchao, He, Wenjun, Chen, Qi, Chen, Liang, Gu, Wanjun, Zhou, Tong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8242334/
https://www.ncbi.nlm.nih.gov/pubmed/33346087
http://dx.doi.org/10.1016/j.gpb.2019.02.003
_version_ 1783715612530311168
author Wang, Jun
Xie, Xueying
Shi, Junchao
He, Wenjun
Chen, Qi
Chen, Liang
Gu, Wanjun
Zhou, Tong
author_facet Wang, Jun
Xie, Xueying
Shi, Junchao
He, Wenjun
Chen, Qi
Chen, Liang
Gu, Wanjun
Zhou, Tong
author_sort Wang, Jun
collection PubMed
description Precise biomarker development is a key step in disease management. However, most of the published biomarkers were derived from a relatively small number of samples with supervised approaches. Recent advances in unsupervised machine learning promise to leverage very large datasets for making better predictions of disease biomarkers. Denoising autoencoder (DA) is one of the unsupervised deep learning algorithms, which is a stochastic version of autoencoder techniques. The principle of DA is to force the hidden layer of autoencoder to capture more robust features by reconstructing a clean input from a corrupted one. Here, a DA model was applied to analyze integrated transcriptomic data from 13 published lung cancer studies, which consisted of 1916 human lung tissue samples. Using DA, we discovered a molecular signature composed of multiple genes for lung adenocarcinoma (ADC). In independent validation cohorts, the proposed molecular signature is proved to be an effective classifier for lung cancer histological subtypes. Also, this signature successfully predicts clinical outcome in lung ADC, which is independent of traditional prognostic factors. More importantly, this signature exhibits a superior prognostic power compared with the other published prognostic genes. Our study suggests that unsupervised learning is helpful for biomarker development in the era of precision medicine.
format Online
Article
Text
id pubmed-8242334
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-82423342021-07-02 Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma Wang, Jun Xie, Xueying Shi, Junchao He, Wenjun Chen, Qi Chen, Liang Gu, Wanjun Zhou, Tong Genomics Proteomics Bioinformatics Original Research Precise biomarker development is a key step in disease management. However, most of the published biomarkers were derived from a relatively small number of samples with supervised approaches. Recent advances in unsupervised machine learning promise to leverage very large datasets for making better predictions of disease biomarkers. Denoising autoencoder (DA) is one of the unsupervised deep learning algorithms, which is a stochastic version of autoencoder techniques. The principle of DA is to force the hidden layer of autoencoder to capture more robust features by reconstructing a clean input from a corrupted one. Here, a DA model was applied to analyze integrated transcriptomic data from 13 published lung cancer studies, which consisted of 1916 human lung tissue samples. Using DA, we discovered a molecular signature composed of multiple genes for lung adenocarcinoma (ADC). In independent validation cohorts, the proposed molecular signature is proved to be an effective classifier for lung cancer histological subtypes. Also, this signature successfully predicts clinical outcome in lung ADC, which is independent of traditional prognostic factors. More importantly, this signature exhibits a superior prognostic power compared with the other published prognostic genes. Our study suggests that unsupervised learning is helpful for biomarker development in the era of precision medicine. Elsevier 2020-08 2020-12-18 /pmc/articles/PMC8242334/ /pubmed/33346087 http://dx.doi.org/10.1016/j.gpb.2019.02.003 Text en © 2020 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Original Research
Wang, Jun
Xie, Xueying
Shi, Junchao
He, Wenjun
Chen, Qi
Chen, Liang
Gu, Wanjun
Zhou, Tong
Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma
title Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma
title_full Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma
title_fullStr Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma
title_full_unstemmed Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma
title_short Denoising Autoencoder, A Deep Learning Algorithm, Aids the Identification of A Novel Molecular Signature of Lung Adenocarcinoma
title_sort denoising autoencoder, a deep learning algorithm, aids the identification of a novel molecular signature of lung adenocarcinoma
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8242334/
https://www.ncbi.nlm.nih.gov/pubmed/33346087
http://dx.doi.org/10.1016/j.gpb.2019.02.003
work_keys_str_mv AT wangjun denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT xiexueying denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT shijunchao denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT hewenjun denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT chenqi denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT chenliang denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT guwanjun denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma
AT zhoutong denoisingautoencoderadeeplearningalgorithmaidstheidentificationofanovelmolecularsignatureoflungadenocarcinoma