Cargando…
Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival predic...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6123437/ https://www.ncbi.nlm.nih.gov/pubmed/30181543 http://dx.doi.org/10.1038/s41598-018-31497-0 |
_version_ | 1783352838816006144 |
---|---|
author | Wang, Wei Liu, Wei |
author_facet | Wang, Wei Liu, Wei |
author_sort | Wang, Wei |
collection | PubMed |
description | Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival prediction problems with high-dimensional variables, but also is capable of identifying important variables according to variable importance automatically calculated within the algorithm. However, RSF still suffers from some problems such as limited predictive accuracy on independent datasets and limited biological interpretation of survival biomarkers. In this study, we integrated gene interaction information into a Reweighted RSF model (RRSF) to improve predictive accuracy and identify biologically meaningful survival markers. We applied RRSF to the prediction of patients with glioblastoma multiforme (GBM) and esophageal squamous cell carcinoma (ESCC). With a reconstructed global pathway network and an mRNA-lncRNA co-expression network as the prior gene interaction information, RRSF showed better overall predictive performance than RSF on three GBM and two ESCC datasets. In addition, RRSF identified a two-gene and three-lncRNA signature, which showed robust prognostic values and had high biological relevance to the development of GBM and ESCC, respectively. |
format | Online Article Text |
id | pubmed-6123437 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-61234372018-09-10 Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery Wang, Wei Liu, Wei Sci Rep Article Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival prediction problems with high-dimensional variables, but also is capable of identifying important variables according to variable importance automatically calculated within the algorithm. However, RSF still suffers from some problems such as limited predictive accuracy on independent datasets and limited biological interpretation of survival biomarkers. In this study, we integrated gene interaction information into a Reweighted RSF model (RRSF) to improve predictive accuracy and identify biologically meaningful survival markers. We applied RRSF to the prediction of patients with glioblastoma multiforme (GBM) and esophageal squamous cell carcinoma (ESCC). With a reconstructed global pathway network and an mRNA-lncRNA co-expression network as the prior gene interaction information, RRSF showed better overall predictive performance than RSF on three GBM and two ESCC datasets. In addition, RRSF identified a two-gene and three-lncRNA signature, which showed robust prognostic values and had high biological relevance to the development of GBM and ESCC, respectively. Nature Publishing Group UK 2018-09-04 /pmc/articles/PMC6123437/ /pubmed/30181543 http://dx.doi.org/10.1038/s41598-018-31497-0 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Wang, Wei Liu, Wei Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
title | Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
title_full | Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
title_fullStr | Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
title_full_unstemmed | Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
title_short | Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
title_sort | integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6123437/ https://www.ncbi.nlm.nih.gov/pubmed/30181543 http://dx.doi.org/10.1038/s41598-018-31497-0 |
work_keys_str_mv | AT wangwei integrationofgeneinteractioninformationintoareweightedrandomsurvivalforestapproachforaccuratesurvivalpredictionandsurvivalbiomarkerdiscovery AT liuwei integrationofgeneinteractioninformationintoareweightedrandomsurvivalforestapproachforaccuratesurvivalpredictionandsurvivalbiomarkerdiscovery |