Cargando…

Knowledge-guided gene ranking by coordinative component analysis

BACKGROUND: In cancer, gene networks and pathways often exhibit dynamic behavior, particularly during the process of carcinogenesis. Thus, it is important to prioritize those genes that are strongly associated with the functionality of a network. Traditional statistical methods are often inept to id...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chen, Xuan, Jianhua, Li, Huai, Wang, Yue, Zhan, Ming, Hoffman, Eric P, Clarke, Robert
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865494/
https://www.ncbi.nlm.nih.gov/pubmed/20353603
http://dx.doi.org/10.1186/1471-2105-11-162
_version_ 1782180843368742912
author Wang, Chen
Xuan, Jianhua
Li, Huai
Wang, Yue
Zhan, Ming
Hoffman, Eric P
Clarke, Robert
author_facet Wang, Chen
Xuan, Jianhua
Li, Huai
Wang, Yue
Zhan, Ming
Hoffman, Eric P
Clarke, Robert
author_sort Wang, Chen
collection PubMed
description BACKGROUND: In cancer, gene networks and pathways often exhibit dynamic behavior, particularly during the process of carcinogenesis. Thus, it is important to prioritize those genes that are strongly associated with the functionality of a network. Traditional statistical methods are often inept to identify biologically relevant member genes, motivating researchers to incorporate biological knowledge into gene ranking methods. However, current integration strategies are often heuristic and fail to incorporate fully the true interplay between biological knowledge and gene expression data. RESULTS: To improve knowledge-guided gene ranking, we propose a novel method called coordinative component analysis (COCA) in this paper. COCA explicitly captures those genes within a specific biological context that are likely to be expressed in a coordinative manner. Formulated as an optimization problem to maximize the coordinative effort, COCA is designed to first extract the coordinative components based on a partial guidance from knowledge genes and then rank the genes according to their participation strengths. An embedded bootstrapping procedure is implemented to improve statistical robustness of the solutions. COCA was initially tested on simulation data and then on published gene expression microarray data to demonstrate its improved performance as compared to traditional statistical methods. Finally, the COCA approach has been applied to stem cell data to identify biologically relevant genes in signaling pathways. As a result, the COCA approach uncovers novel pathway members that may shed light into the pathway deregulation in cancers. CONCLUSION: We have developed a new integrative strategy to combine biological knowledge and microarray data for gene ranking. The method utilizes knowledge genes for a guidance to first extract coordinative components, and then rank the genes according to their contribution related to a network or pathway. The experimental results show that such a knowledge-guided strategy can provide context-specific gene ranking with an improved performance in pathway member identification.
format Text
id pubmed-2865494
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28654942010-05-07 Knowledge-guided gene ranking by coordinative component analysis Wang, Chen Xuan, Jianhua Li, Huai Wang, Yue Zhan, Ming Hoffman, Eric P Clarke, Robert BMC Bioinformatics Methodology article BACKGROUND: In cancer, gene networks and pathways often exhibit dynamic behavior, particularly during the process of carcinogenesis. Thus, it is important to prioritize those genes that are strongly associated with the functionality of a network. Traditional statistical methods are often inept to identify biologically relevant member genes, motivating researchers to incorporate biological knowledge into gene ranking methods. However, current integration strategies are often heuristic and fail to incorporate fully the true interplay between biological knowledge and gene expression data. RESULTS: To improve knowledge-guided gene ranking, we propose a novel method called coordinative component analysis (COCA) in this paper. COCA explicitly captures those genes within a specific biological context that are likely to be expressed in a coordinative manner. Formulated as an optimization problem to maximize the coordinative effort, COCA is designed to first extract the coordinative components based on a partial guidance from knowledge genes and then rank the genes according to their participation strengths. An embedded bootstrapping procedure is implemented to improve statistical robustness of the solutions. COCA was initially tested on simulation data and then on published gene expression microarray data to demonstrate its improved performance as compared to traditional statistical methods. Finally, the COCA approach has been applied to stem cell data to identify biologically relevant genes in signaling pathways. As a result, the COCA approach uncovers novel pathway members that may shed light into the pathway deregulation in cancers. CONCLUSION: We have developed a new integrative strategy to combine biological knowledge and microarray data for gene ranking. The method utilizes knowledge genes for a guidance to first extract coordinative components, and then rank the genes according to their contribution related to a network or pathway. The experimental results show that such a knowledge-guided strategy can provide context-specific gene ranking with an improved performance in pathway member identification. BioMed Central 2010-03-30 /pmc/articles/PMC2865494/ /pubmed/20353603 http://dx.doi.org/10.1186/1471-2105-11-162 Text en Copyright ©2010 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Wang, Chen
Xuan, Jianhua
Li, Huai
Wang, Yue
Zhan, Ming
Hoffman, Eric P
Clarke, Robert
Knowledge-guided gene ranking by coordinative component analysis
title Knowledge-guided gene ranking by coordinative component analysis
title_full Knowledge-guided gene ranking by coordinative component analysis
title_fullStr Knowledge-guided gene ranking by coordinative component analysis
title_full_unstemmed Knowledge-guided gene ranking by coordinative component analysis
title_short Knowledge-guided gene ranking by coordinative component analysis
title_sort knowledge-guided gene ranking by coordinative component analysis
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865494/
https://www.ncbi.nlm.nih.gov/pubmed/20353603
http://dx.doi.org/10.1186/1471-2105-11-162
work_keys_str_mv AT wangchen knowledgeguidedgenerankingbycoordinativecomponentanalysis
AT xuanjianhua knowledgeguidedgenerankingbycoordinativecomponentanalysis
AT lihuai knowledgeguidedgenerankingbycoordinativecomponentanalysis
AT wangyue knowledgeguidedgenerankingbycoordinativecomponentanalysis
AT zhanming knowledgeguidedgenerankingbycoordinativecomponentanalysis
AT hoffmanericp knowledgeguidedgenerankingbycoordinativecomponentanalysis
AT clarkerobert knowledgeguidedgenerankingbycoordinativecomponentanalysis