Cargando…

Detecting disease associated modules and prioritizing active genes based on high throughput data

BACKGROUND: The accumulation of high-throughput data greatly promotes computational investigation of gene function in the context of complex biological systems. However, a biological function is not simply controlled by an individual gene since genes function in a cooperative manner to achieve biolo...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Yu-Qing, Zhang, Shihua, Zhang, Xiang-Sun, Chen, Luonan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2825224/
https://www.ncbi.nlm.nih.gov/pubmed/20070902
http://dx.doi.org/10.1186/1471-2105-11-26
_version_ 1782177796528799744
author Qiu, Yu-Qing
Zhang, Shihua
Zhang, Xiang-Sun
Chen, Luonan
author_facet Qiu, Yu-Qing
Zhang, Shihua
Zhang, Xiang-Sun
Chen, Luonan
author_sort Qiu, Yu-Qing
collection PubMed
description BACKGROUND: The accumulation of high-throughput data greatly promotes computational investigation of gene function in the context of complex biological systems. However, a biological function is not simply controlled by an individual gene since genes function in a cooperative manner to achieve biological processes. In the study of human diseases, rather than to discover disease related genes, identifying disease associated pathways and modules becomes an essential problem in the field of systems biology. RESULTS: In this paper, we propose a novel method to detect disease related gene modules or dysfunctional pathways based on global characteristics of interactome coupled with gene expression data. Specifically, we exploit interacting relationships between genes to define a gene's active score function based on the kernel trick, which can represent nonlinear effects of gene cooperativity. Then, modules or pathways are inferred based on the active scores evaluated by the support vector regression in a global and integrative manner. The efficiency and robustness of the proposed method are comprehensively validated by using both simulated and real data with the comparison to existing methods. CONCLUSIONS: By applying the proposed method to two cancer related problems, i.e. breast cancer and prostate cancer, we successfully identified active modules or dysfunctional pathways related to these two types of cancers with literature confirmed evidences. We show that this network-based method is highly efficient and can be applied to a large-scale problem especially for human disease related modules or pathway extraction. Moreover, this method can also be used for prioritizing genes associated with a specific phenotype or disease.
format Text
id pubmed-2825224
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28252242010-02-20 Detecting disease associated modules and prioritizing active genes based on high throughput data Qiu, Yu-Qing Zhang, Shihua Zhang, Xiang-Sun Chen, Luonan BMC Bioinformatics Research article BACKGROUND: The accumulation of high-throughput data greatly promotes computational investigation of gene function in the context of complex biological systems. However, a biological function is not simply controlled by an individual gene since genes function in a cooperative manner to achieve biological processes. In the study of human diseases, rather than to discover disease related genes, identifying disease associated pathways and modules becomes an essential problem in the field of systems biology. RESULTS: In this paper, we propose a novel method to detect disease related gene modules or dysfunctional pathways based on global characteristics of interactome coupled with gene expression data. Specifically, we exploit interacting relationships between genes to define a gene's active score function based on the kernel trick, which can represent nonlinear effects of gene cooperativity. Then, modules or pathways are inferred based on the active scores evaluated by the support vector regression in a global and integrative manner. The efficiency and robustness of the proposed method are comprehensively validated by using both simulated and real data with the comparison to existing methods. CONCLUSIONS: By applying the proposed method to two cancer related problems, i.e. breast cancer and prostate cancer, we successfully identified active modules or dysfunctional pathways related to these two types of cancers with literature confirmed evidences. We show that this network-based method is highly efficient and can be applied to a large-scale problem especially for human disease related modules or pathway extraction. Moreover, this method can also be used for prioritizing genes associated with a specific phenotype or disease. BioMed Central 2010-01-13 /pmc/articles/PMC2825224/ /pubmed/20070902 http://dx.doi.org/10.1186/1471-2105-11-26 Text en Copyright ©2010 Qiu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Qiu, Yu-Qing
Zhang, Shihua
Zhang, Xiang-Sun
Chen, Luonan
Detecting disease associated modules and prioritizing active genes based on high throughput data
title Detecting disease associated modules and prioritizing active genes based on high throughput data
title_full Detecting disease associated modules and prioritizing active genes based on high throughput data
title_fullStr Detecting disease associated modules and prioritizing active genes based on high throughput data
title_full_unstemmed Detecting disease associated modules and prioritizing active genes based on high throughput data
title_short Detecting disease associated modules and prioritizing active genes based on high throughput data
title_sort detecting disease associated modules and prioritizing active genes based on high throughput data
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2825224/
https://www.ncbi.nlm.nih.gov/pubmed/20070902
http://dx.doi.org/10.1186/1471-2105-11-26
work_keys_str_mv AT qiuyuqing detectingdiseaseassociatedmodulesandprioritizingactivegenesbasedonhighthroughputdata
AT zhangshihua detectingdiseaseassociatedmodulesandprioritizingactivegenesbasedonhighthroughputdata
AT zhangxiangsun detectingdiseaseassociatedmodulesandprioritizingactivegenesbasedonhighthroughputdata
AT chenluonan detectingdiseaseassociatedmodulesandprioritizingactivegenesbasedonhighthroughputdata