Cargando…

Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis

BACKGROUND: In cancer prognosis studies with gene expression measurements, an important goal is to construct gene signatures with predictive power. In this study, we describe the coordination among genes using the weighted coexpression network, where nodes represent genes and nodes are connected if...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Shuangge, Kosorok, Michael R, Huang, Jian, Dai, Ying
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3037289/
https://www.ncbi.nlm.nih.gov/pubmed/21226928
http://dx.doi.org/10.1186/1755-8794-4-5
_version_ 1782197964722143232
author Ma, Shuangge
Kosorok, Michael R
Huang, Jian
Dai, Ying
author_facet Ma, Shuangge
Kosorok, Michael R
Huang, Jian
Dai, Ying
author_sort Ma, Shuangge
collection PubMed
description BACKGROUND: In cancer prognosis studies with gene expression measurements, an important goal is to construct gene signatures with predictive power. In this study, we describe the coordination among genes using the weighted coexpression network, where nodes represent genes and nodes are connected if the corresponding genes have similar expression patterns across samples. There are subsets of nodes, called modules, that are tightly connected to each other. In several published studies, it has been suggested that the first principal components of individual modules, also referred to as "eigengenes", may sufficiently represent the corresponding modules. RESULTS: In this article, we refer to principal components and their functions as representative features". We investigate higher-order representative features, which include the principal components other than the first ones and second order terms (quadratics and interactions). Two gradient thresholding methods are adopted for regularized estimation and feature selection. Analysis of six prognosis studies on lymphoma and breast cancer shows that incorporating higher-order representative features improves prediction performance over using eigengenes only. Simulation study further shows that prediction performance can be less satisfactory if the representative feature set is not properly chosen. CONCLUSIONS: This study introduces multiple ways of defining the representative features and effective thresholding regularized estimation approaches. It provides convincing evidence that the higher-order representative features may have important implications for the prediction of cancer prognosis.
format Text
id pubmed-3037289
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30372892011-02-18 Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis Ma, Shuangge Kosorok, Michael R Huang, Jian Dai, Ying BMC Med Genomics Research Article BACKGROUND: In cancer prognosis studies with gene expression measurements, an important goal is to construct gene signatures with predictive power. In this study, we describe the coordination among genes using the weighted coexpression network, where nodes represent genes and nodes are connected if the corresponding genes have similar expression patterns across samples. There are subsets of nodes, called modules, that are tightly connected to each other. In several published studies, it has been suggested that the first principal components of individual modules, also referred to as "eigengenes", may sufficiently represent the corresponding modules. RESULTS: In this article, we refer to principal components and their functions as representative features". We investigate higher-order representative features, which include the principal components other than the first ones and second order terms (quadratics and interactions). Two gradient thresholding methods are adopted for regularized estimation and feature selection. Analysis of six prognosis studies on lymphoma and breast cancer shows that incorporating higher-order representative features improves prediction performance over using eigengenes only. Simulation study further shows that prediction performance can be less satisfactory if the representative feature set is not properly chosen. CONCLUSIONS: This study introduces multiple ways of defining the representative features and effective thresholding regularized estimation approaches. It provides convincing evidence that the higher-order representative features may have important implications for the prediction of cancer prognosis. BioMed Central 2011-01-12 /pmc/articles/PMC3037289/ /pubmed/21226928 http://dx.doi.org/10.1186/1755-8794-4-5 Text en Copyright ©2011 Ma et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ma, Shuangge
Kosorok, Michael R
Huang, Jian
Dai, Ying
Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
title Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
title_full Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
title_fullStr Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
title_full_unstemmed Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
title_short Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
title_sort incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3037289/
https://www.ncbi.nlm.nih.gov/pubmed/21226928
http://dx.doi.org/10.1186/1755-8794-4-5
work_keys_str_mv AT mashuangge incorporatinghigherorderrepresentativefeaturesimprovespredictioninnetworkbasedcancerprognosisanalysis
AT kosorokmichaelr incorporatinghigherorderrepresentativefeaturesimprovespredictioninnetworkbasedcancerprognosisanalysis
AT huangjian incorporatinghigherorderrepresentativefeaturesimprovespredictioninnetworkbasedcancerprognosisanalysis
AT daiying incorporatinghigherorderrepresentativefeaturesimprovespredictioninnetworkbasedcancerprognosisanalysis