Cargando…

Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features

Accurate prognosis of patients with cancer is important for the stratification of patients, the optimization of treatment strategies, and the design of clinical trials. Both clinical features and molecular data can be used for this purpose, for instance, to predict the survival of patients censored...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Gangcai, Dong, Chengliang, Kong, Yinfei, Zhong, Jiang F., Li, Mingyao, Wang, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471789/
https://www.ncbi.nlm.nih.gov/pubmed/30901858
http://dx.doi.org/10.3390/genes10030240
_version_ 1783412105082306560
author Xie, Gangcai
Dong, Chengliang
Kong, Yinfei
Zhong, Jiang F.
Li, Mingyao
Wang, Kai
author_facet Xie, Gangcai
Dong, Chengliang
Kong, Yinfei
Zhong, Jiang F.
Li, Mingyao
Wang, Kai
author_sort Xie, Gangcai
collection PubMed
description Accurate prognosis of patients with cancer is important for the stratification of patients, the optimization of treatment strategies, and the design of clinical trials. Both clinical features and molecular data can be used for this purpose, for instance, to predict the survival of patients censored at specific time points. Multi-omics data, including genome-wide gene expression, methylation, protein expression, copy number alteration, and somatic mutation data, are becoming increasingly common in cancer studies. To harness the rich information in multi-omics data, we developed GDP (Group lass regularized Deep learning for cancer Prognosis), a computational tool for survival prediction using both clinical and multi-omics data. GDP integrated a deep learning framework and Cox proportional hazard model (CPH) together, and applied group lasso regularization to incorporate gene-level group prior knowledge into the model training process. We evaluated its performance in both simulated and real data from The Cancer Genome Atlas (TCGA) project. In simulated data, our results supported the importance of group prior information in the regularization of the model. Compared to the standard lasso regularization, we showed that group lasso achieved higher prediction accuracy when the group prior knowledge was provided. We also found that GDP performed better than CPH for complex survival data. Furthermore, analysis on real data demonstrated that GDP performed favorably against other methods in several cancers with large-scale omics data sets, such as glioblastoma multiforme, kidney renal clear cell carcinoma, and bladder urothelial carcinoma. In summary, we demonstrated that GDP is a powerful tool for prognosis of patients with cancer, especially when large-scale molecular features are available.
format Online
Article
Text
id pubmed-6471789
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64717892019-04-27 Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features Xie, Gangcai Dong, Chengliang Kong, Yinfei Zhong, Jiang F. Li, Mingyao Wang, Kai Genes (Basel) Article Accurate prognosis of patients with cancer is important for the stratification of patients, the optimization of treatment strategies, and the design of clinical trials. Both clinical features and molecular data can be used for this purpose, for instance, to predict the survival of patients censored at specific time points. Multi-omics data, including genome-wide gene expression, methylation, protein expression, copy number alteration, and somatic mutation data, are becoming increasingly common in cancer studies. To harness the rich information in multi-omics data, we developed GDP (Group lass regularized Deep learning for cancer Prognosis), a computational tool for survival prediction using both clinical and multi-omics data. GDP integrated a deep learning framework and Cox proportional hazard model (CPH) together, and applied group lasso regularization to incorporate gene-level group prior knowledge into the model training process. We evaluated its performance in both simulated and real data from The Cancer Genome Atlas (TCGA) project. In simulated data, our results supported the importance of group prior information in the regularization of the model. Compared to the standard lasso regularization, we showed that group lasso achieved higher prediction accuracy when the group prior knowledge was provided. We also found that GDP performed better than CPH for complex survival data. Furthermore, analysis on real data demonstrated that GDP performed favorably against other methods in several cancers with large-scale omics data sets, such as glioblastoma multiforme, kidney renal clear cell carcinoma, and bladder urothelial carcinoma. In summary, we demonstrated that GDP is a powerful tool for prognosis of patients with cancer, especially when large-scale molecular features are available. MDPI 2019-03-21 /pmc/articles/PMC6471789/ /pubmed/30901858 http://dx.doi.org/10.3390/genes10030240 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xie, Gangcai
Dong, Chengliang
Kong, Yinfei
Zhong, Jiang F.
Li, Mingyao
Wang, Kai
Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features
title Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features
title_full Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features
title_fullStr Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features
title_full_unstemmed Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features
title_short Group Lasso Regularized Deep Learning for Cancer Prognosis from Multi-Omics and Clinical Features
title_sort group lasso regularized deep learning for cancer prognosis from multi-omics and clinical features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471789/
https://www.ncbi.nlm.nih.gov/pubmed/30901858
http://dx.doi.org/10.3390/genes10030240
work_keys_str_mv AT xiegangcai grouplassoregularizeddeeplearningforcancerprognosisfrommultiomicsandclinicalfeatures
AT dongchengliang grouplassoregularizeddeeplearningforcancerprognosisfrommultiomicsandclinicalfeatures
AT kongyinfei grouplassoregularizeddeeplearningforcancerprognosisfrommultiomicsandclinicalfeatures
AT zhongjiangf grouplassoregularizeddeeplearningforcancerprognosisfrommultiomicsandclinicalfeatures
AT limingyao grouplassoregularizeddeeplearningforcancerprognosisfrommultiomicsandclinicalfeatures
AT wangkai grouplassoregularizeddeeplearningforcancerprognosisfrommultiomicsandclinicalfeatures