Cargando…
Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform
BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machin...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7236453/ https://www.ncbi.nlm.nih.gov/pubmed/32429941 http://dx.doi.org/10.1186/s12859-020-03544-z |
_version_ | 1783536158449336320 |
---|---|
author | Zhao, Yiru Zhou, Yifan Liu, Yuan Hao, Yinyi Li, Menglong Pu, Xuemei Li, Chuan Wen, Zhining |
author_facet | Zhao, Yiru Zhou, Yifan Liu, Yuan Hao, Yinyi Li, Menglong Pu, Xuemei Li, Chuan Wen, Zhining |
author_sort | Zhao, Yiru |
collection | PubMed |
description | BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machine learning methods involved a gene filtering step, in which tens of thousands of genes were firstly filtered based on the gene expression levels by a statistical method with an arbitrary cutoff. Although gene filtering procedure helps to reduce the feature dimension and avoid overfitting, there is a risk that some pathogenic genes important to the disease will be ignored. RESULTS: In this study, we proposed a novel deep learning approach by combining a convolutional neural network with stationary wavelet transform (SWT-CNN) for stratifying cancer patients and predicting their clinical outcomes without gene filtering based on tumor genomic profiles. The proposed SWT-CNN overperformed the state-of-art algorithms, including support vector machine (SVM) and logistic regression (LR), and produced comparable prediction performance to random forest (RF). Furthermore, for all the cancer types, we firstly proposed a method to weight the genes with the scores, which took advantage of the representative features in the hidden layer of convolutional neural network, and then selected the prognostic genes for the Cox proportional-hazards regression. The results showed that risk stratifications can be effectively improved by using the identified prognostic genes as feature, indicating that the representative features generated by SWT-CNN can well correlate the genes with prognostic risk in cancers and be helpful for selecting the prognostic gene signatures. CONCLUSIONS: Our results indicated that gene expression-based SWT-CNN model can be an excellent tool for stratifying the prognostic risk for cancer patients. In addition, the representative features of SWT-CNN were validated to be useful for evaluating the importance of the genes in the risk stratification and can be further used to identify the prognostic gene signatures. |
format | Online Article Text |
id | pubmed-7236453 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-72364532020-05-29 Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform Zhao, Yiru Zhou, Yifan Liu, Yuan Hao, Yinyi Li, Menglong Pu, Xuemei Li, Chuan Wen, Zhining BMC Bioinformatics Methodology Article BACKGROUND: The aim of gene expression-based clinical modelling in tumorigenesis is not only to accurately predict the clinical endpoints, but also to reveal the genome characteristics for downstream analysis for the purpose of understanding the mechanisms of cancers. Most of the conventional machine learning methods involved a gene filtering step, in which tens of thousands of genes were firstly filtered based on the gene expression levels by a statistical method with an arbitrary cutoff. Although gene filtering procedure helps to reduce the feature dimension and avoid overfitting, there is a risk that some pathogenic genes important to the disease will be ignored. RESULTS: In this study, we proposed a novel deep learning approach by combining a convolutional neural network with stationary wavelet transform (SWT-CNN) for stratifying cancer patients and predicting their clinical outcomes without gene filtering based on tumor genomic profiles. The proposed SWT-CNN overperformed the state-of-art algorithms, including support vector machine (SVM) and logistic regression (LR), and produced comparable prediction performance to random forest (RF). Furthermore, for all the cancer types, we firstly proposed a method to weight the genes with the scores, which took advantage of the representative features in the hidden layer of convolutional neural network, and then selected the prognostic genes for the Cox proportional-hazards regression. The results showed that risk stratifications can be effectively improved by using the identified prognostic genes as feature, indicating that the representative features generated by SWT-CNN can well correlate the genes with prognostic risk in cancers and be helpful for selecting the prognostic gene signatures. CONCLUSIONS: Our results indicated that gene expression-based SWT-CNN model can be an excellent tool for stratifying the prognostic risk for cancer patients. In addition, the representative features of SWT-CNN were validated to be useful for evaluating the importance of the genes in the risk stratification and can be further used to identify the prognostic gene signatures. BioMed Central 2020-05-19 /pmc/articles/PMC7236453/ /pubmed/32429941 http://dx.doi.org/10.1186/s12859-020-03544-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Zhao, Yiru Zhou, Yifan Liu, Yuan Hao, Yinyi Li, Menglong Pu, Xuemei Li, Chuan Wen, Zhining Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
title | Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
title_full | Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
title_fullStr | Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
title_full_unstemmed | Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
title_short | Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
title_sort | uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7236453/ https://www.ncbi.nlm.nih.gov/pubmed/32429941 http://dx.doi.org/10.1186/s12859-020-03544-z |
work_keys_str_mv | AT zhaoyiru uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT zhouyifan uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT liuyuan uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT haoyinyi uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT limenglong uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT puxuemei uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT lichuan uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform AT wenzhining uncoveringtheprognosticgenesignaturesfortheimprovementofriskstratificationincancersbyusingdeeplearningalgorithmcoupledwithwavelettransform |