Cargando…

Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform

BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machin...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yiru, Zhou, Yifan, Liu, Yuan, Hao, Yinyi, Li, Menglong, Pu, Xuemei, Li, Chuan, Wen, Zhining
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7236453/
https://www.ncbi.nlm.nih.gov/pubmed/32429941
http://dx.doi.org/10.1186/s12859-020-03544-z
_version_ 1783536158449336320
author Zhao, Yiru
Zhou, Yifan
Liu, Yuan
Hao, Yinyi
Li, Menglong
Pu, Xuemei
Li, Chuan
Wen, Zhining
author_facet Zhao, Yiru
Zhou, Yifan
Liu, Yuan
Hao, Yinyi
Li, Menglong
Pu, Xuemei
Li, Chuan
Wen, Zhining
author_sort Zhao, Yiru
collection PubMed
description BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machine learning methods involved a gene filtering step, in which tens of thousands of genes were firstly filtered based on the gene expression levels by a statistical method with an arbitrary cutoff. Although gene filtering procedure helps to reduce the feature dimension and avoid overfitting, there is a risk that some pathogenic genes important to the disease will be ignored. RESULTS: In this study, we proposed a novel deep learning approach by combining a convolutional neural network with stationary wavelet transform (SWT-CNN) for stratifying cancer patients and predicting their clinical outcomes without gene filtering based on tumor genomic profiles. The proposed SWT-CNN overperformed the state-of-art algorithms, including support vector machine (SVM) and logistic regression (LR), and produced comparable prediction performance to random forest (RF). Furthermore, for all the cancer types, we firstly proposed a method to weight the genes with the scores, which took advantage of the representative features in the hidden layer of convolutional neural network, and then selected the prognostic genes for the Cox proportional-hazards regression. The results showed that risk stratifications can be effectively improved by using the identified prognostic genes as feature, indicating that the representative features generated by SWT-CNN can well correlate the genes with prognostic risk in cancers and be helpful for selecting the prognostic gene signatures. CONCLUSIONS: Our results indicated that gene expression-based SWT-CNN model can be an excellent tool for stratifying the prognostic risk for cancer patients. In addition, the representative features of SWT-CNN were validated to be useful for evaluating the importance of the genes in the risk stratification and can be further used to identify the prognostic gene signatures.
format Online
Article
Text
id pubmed-7236453
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72364532020-05-29 Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform Zhao, Yiru Zhou, Yifan Liu, Yuan Hao, Yinyi Li, Menglong Pu, Xuemei Li, Chuan Wen, Zhining BMC Bioinformatics Methodology Article BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machine learning methods involved a gene filtering step, in which tens of thousands of genes were firstly filtered based on the gene expression levels by a statistical method with an arbitrary cutoff. Although gene filtering procedure helps to reduce the feature dimension and avoid overfitting, there is a risk that some pathogenic genes important to the disease will be ignored. RESULTS: In this study, we proposed a novel deep learning approach by combining a convolutional neural network with stationary wavelet transform (SWT-CNN) for stratifying cancer patients and predicting their clinical outcomes without gene filtering based on tumor genomic profiles. The proposed SWT-CNN overperformed the state-of-art algorithms, including support vector machine (SVM) and logistic regression (LR), and produced comparable prediction performance to random forest (RF). Furthermore, for all the cancer types, we firstly proposed a method to weight the genes with the scores, which took advantage of the representative features in the hidden layer of convolutional neural network, and then selected the prognostic genes for the Cox proportional-hazards regression. The results showed that risk stratifications can be effectively improved by using the identified prognostic genes as feature, indicating that the representative features generated by SWT-CNN can well correlate the genes with prognostic risk in cancers and be helpful for selecting the prognostic gene signatures. CONCLUSIONS: Our results indicated that gene expression-based SWT-CNN model can be an excellent tool for stratifying the prognostic risk for cancer patients. In addition, the representative features of SWT-CNN were validated to be useful for evaluating the importance of the genes in the risk stratification and can be further used to identify the prognostic gene signatures. BioMed Central 2020-05-19 /pmc/articles/PMC7236453/ /pubmed/32429941 http://dx.doi.org/10.1186/s12859-020-03544-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Zhao, Yiru
Zhou, Yifan
Liu, Yuan
Hao, Yinyi
Li, Menglong
Pu, Xuemei
Li, Chuan
Wen, Zhining
Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
title Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
title_full Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
title_fullStr Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
title_full_unstemmed Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
title_short Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
title_sort uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7236453/
https://www.ncbi.nlm.nih.gov/pubmed/32429941
http://dx.doi.org/10.1186/s12859-020-03544-z
work_keys_str_mv AT zhaoyiru uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT zhouyifan uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT liuyuan uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT haoyinyi uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT limenglong uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT puxuemei uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT lichuan uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform
AT wenzhining uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform