Cargando…

Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery

Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Wei, Liu, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6123437/
https://www.ncbi.nlm.nih.gov/pubmed/30181543
http://dx.doi.org/10.1038/s41598-018-31497-0
_version_ 1783352838816006144
author Wang, Wei
Liu, Wei
author_facet Wang, Wei
Liu, Wei
author_sort Wang, Wei
collection PubMed
description Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival prediction problems with high-dimensional variables, but also is capable of identifying important variables according to variable importance automatically calculated within the algorithm. However, RSF still suffers from some problems such as limited predictive accuracy on independent datasets and limited biological interpretation of survival biomarkers. In this study, we integrated gene interaction information into a Reweighted RSF model (RRSF) to improve predictive accuracy and identify biologically meaningful survival markers. We applied RRSF to the prediction of patients with glioblastoma multiforme (GBM) and esophageal squamous cell carcinoma (ESCC). With a reconstructed global pathway network and an mRNA-lncRNA co-expression network as the prior gene interaction information, RRSF showed better overall predictive performance than RSF on three GBM and two ESCC datasets. In addition, RRSF identified a two-gene and three-lncRNA signature, which showed robust prognostic values and had high biological relevance to the development of GBM and ESCC, respectively.
format Online
Article
Text
id pubmed-6123437
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-61234372018-09-10 Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery Wang, Wei Liu, Wei Sci Rep Article Accurately predicting patient risk and identifying survival biomarkers are two important tasks in survival analysis. For the emerging high-throughput gene expression data, random survival forest (RSF) is attracting more and more attention as it not only shows excellent performance on survival prediction problems with high-dimensional variables, but also is capable of identifying important variables according to variable importance automatically calculated within the algorithm. However, RSF still suffers from some problems such as limited predictive accuracy on independent datasets and limited biological interpretation of survival biomarkers. In this study, we integrated gene interaction information into a Reweighted RSF model (RRSF) to improve predictive accuracy and identify biologically meaningful survival markers. We applied RRSF to the prediction of patients with glioblastoma multiforme (GBM) and esophageal squamous cell carcinoma (ESCC). With a reconstructed global pathway network and an mRNA-lncRNA co-expression network as the prior gene interaction information, RRSF showed better overall predictive performance than RSF on three GBM and two ESCC datasets. In addition, RRSF identified a two-gene and three-lncRNA signature, which showed robust prognostic values and had high biological relevance to the development of GBM and ESCC, respectively. Nature Publishing Group UK 2018-09-04 /pmc/articles/PMC6123437/ /pubmed/30181543 http://dx.doi.org/10.1038/s41598-018-31497-0 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Wei
Liu, Wei
Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
title Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
title_full Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
title_fullStr Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
title_full_unstemmed Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
title_short Integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
title_sort integration of gene interaction information into a reweighted random survival forest approach for accurate survival prediction and survival biomarker discovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6123437/
https://www.ncbi.nlm.nih.gov/pubmed/30181543
http://dx.doi.org/10.1038/s41598-018-31497-0
work_keys_str_mv AT wangwei integrationofgeneinteractioninformationintoareweightedrandomsurvivalforestapproachforaccuratesurvivalpredictionandsurvivalbiomarkerdiscovery
AT liuwei integrationofgeneinteractioninformationintoareweightedrandomsurvivalforestapproachforaccuratesurvivalpredictionandsurvivalbiomarkerdiscovery