Cargando…

A copula method for modeling directional dependence of genes

BACKGROUND: Genes interact with each other as basic building blocks of life, forming a complicated network. The relationship between groups of genes with different functions can be represented as gene networks. With the deposition of huge microarray data sets in public domains, study on gene network...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jong-Min, Jung, Yoon-Sung, Sungur, Engin A, Han, Kap-Hoon, Park, Changyi, Sohn, Insuk
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2386493/
https://www.ncbi.nlm.nih.gov/pubmed/18447957
http://dx.doi.org/10.1186/1471-2105-9-225
_version_ 1782155244898091008
author Kim, Jong-Min
Jung, Yoon-Sung
Sungur, Engin A
Han, Kap-Hoon
Park, Changyi
Sohn, Insuk
author_facet Kim, Jong-Min
Jung, Yoon-Sung
Sungur, Engin A
Han, Kap-Hoon
Park, Changyi
Sohn, Insuk
author_sort Kim, Jong-Min
collection PubMed
description BACKGROUND: Genes interact with each other as basic building blocks of life, forming a complicated network. The relationship between groups of genes with different functions can be represented as gene networks. With the deposition of huge microarray data sets in public domains, study on gene networking is now possible. In recent years, there has been an increasing interest in the reconstruction of gene networks from gene expression data. Recent work includes linear models, Boolean network models, and Bayesian networks. Among them, Bayesian networks seem to be the most effective in constructing gene networks. A major problem with the Bayesian network approach is the excessive computational time. This problem is due to the interactive feature of the method that requires large search space. Since fitting a model by using the copulas does not require iterations, elicitation of the priors, and complicated calculations of posterior distributions, the need for reference to extensive search spaces can be eliminated leading to manageable computational affords. Bayesian network approach produces a discretely expression of conditional probabilities. Discreteness of the characteristics is not required in the copula approach which involves use of uniform representation of the continuous random variables. Our method is able to overcome the limitation of Bayesian network method for gene-gene interaction, i.e. information loss due to binary transformation. RESULTS: We analyzed the gene interactions for two gene data sets (one group is eight histone genes and the other group is 19 genes which include DNA polymerases, DNA helicase, type B cyclin genes, DNA primases, radiation sensitive genes, repaire related genes, replication protein A encoding gene, DNA replication initiation factor, securin gene, nucleosome assembly factor, and a subunit of the cohesin complex) by adopting a measure of directional dependence based on a copula function. We have compared our results with those from other methods in the literature. Although microarray results show a transcriptional co-regulation pattern and do not imply that the gene products are physically interactive, this tight genetic connection may suggest that each gene product has either direct or indirect connections between the other gene products. Indeed, recent comprehensive analysis of a protein interaction map revealed that those histone genes are physically connected with each other, supporting the results obtained by our method. CONCLUSION: The results illustrate that our method can be an alternative to Bayesian networks in modeling gene interactions. One advantage of our approach is that dependence between genes is not assumed to be linear. Another advantage is that our approach can detect directional dependence. We expect that our study may help to design artificial drug candidates, which can block or activate biologically meaningful pathways. Moreover, our copula approach can be extended to investigate the effects of local environments on protein-protein interactions. The copula mutual information approach will help to propose the new variant of ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks): an algorithm for the reconstruction of gene regulatory networks.
format Text
id pubmed-2386493
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23864932008-05-16 A copula method for modeling directional dependence of genes Kim, Jong-Min Jung, Yoon-Sung Sungur, Engin A Han, Kap-Hoon Park, Changyi Sohn, Insuk BMC Bioinformatics Methodology Article BACKGROUND: Genes interact with each other as basic building blocks of life, forming a complicated network. The relationship between groups of genes with different functions can be represented as gene networks. With the deposition of huge microarray data sets in public domains, study on gene networking is now possible. In recent years, there has been an increasing interest in the reconstruction of gene networks from gene expression data. Recent work includes linear models, Boolean network models, and Bayesian networks. Among them, Bayesian networks seem to be the most effective in constructing gene networks. A major problem with the Bayesian network approach is the excessive computational time. This problem is due to the interactive feature of the method that requires large search space. Since fitting a model by using the copulas does not require iterations, elicitation of the priors, and complicated calculations of posterior distributions, the need for reference to extensive search spaces can be eliminated leading to manageable computational affords. Bayesian network approach produces a discretely expression of conditional probabilities. Discreteness of the characteristics is not required in the copula approach which involves use of uniform representation of the continuous random variables. Our method is able to overcome the limitation of Bayesian network method for gene-gene interaction, i.e. information loss due to binary transformation. RESULTS: We analyzed the gene interactions for two gene data sets (one group is eight histone genes and the other group is 19 genes which include DNA polymerases, DNA helicase, type B cyclin genes, DNA primases, radiation sensitive genes, repaire related genes, replication protein A encoding gene, DNA replication initiation factor, securin gene, nucleosome assembly factor, and a subunit of the cohesin complex) by adopting a measure of directional dependence based on a copula function. We have compared our results with those from other methods in the literature. Although microarray results show a transcriptional co-regulation pattern and do not imply that the gene products are physically interactive, this tight genetic connection may suggest that each gene product has either direct or indirect connections between the other gene products. Indeed, recent comprehensive analysis of a protein interaction map revealed that those histone genes are physically connected with each other, supporting the results obtained by our method. CONCLUSION: The results illustrate that our method can be an alternative to Bayesian networks in modeling gene interactions. One advantage of our approach is that dependence between genes is not assumed to be linear. Another advantage is that our approach can detect directional dependence. We expect that our study may help to design artificial drug candidates, which can block or activate biologically meaningful pathways. Moreover, our copula approach can be extended to investigate the effects of local environments on protein-protein interactions. The copula mutual information approach will help to propose the new variant of ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks): an algorithm for the reconstruction of gene regulatory networks. BioMed Central 2008-05-01 /pmc/articles/PMC2386493/ /pubmed/18447957 http://dx.doi.org/10.1186/1471-2105-9-225 Text en Copyright © 2008 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Kim, Jong-Min
Jung, Yoon-Sung
Sungur, Engin A
Han, Kap-Hoon
Park, Changyi
Sohn, Insuk
A copula method for modeling directional dependence of genes
title A copula method for modeling directional dependence of genes
title_full A copula method for modeling directional dependence of genes
title_fullStr A copula method for modeling directional dependence of genes
title_full_unstemmed A copula method for modeling directional dependence of genes
title_short A copula method for modeling directional dependence of genes
title_sort copula method for modeling directional dependence of genes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2386493/
https://www.ncbi.nlm.nih.gov/pubmed/18447957
http://dx.doi.org/10.1186/1471-2105-9-225
work_keys_str_mv AT kimjongmin acopulamethodformodelingdirectionaldependenceofgenes
AT jungyoonsung acopulamethodformodelingdirectionaldependenceofgenes
AT sungurengina acopulamethodformodelingdirectionaldependenceofgenes
AT hankaphoon acopulamethodformodelingdirectionaldependenceofgenes
AT parkchangyi acopulamethodformodelingdirectionaldependenceofgenes
AT sohninsuk acopulamethodformodelingdirectionaldependenceofgenes
AT kimjongmin copulamethodformodelingdirectionaldependenceofgenes
AT jungyoonsung copulamethodformodelingdirectionaldependenceofgenes
AT sungurengina copulamethodformodelingdirectionaldependenceofgenes
AT hankaphoon copulamethodformodelingdirectionaldependenceofgenes
AT parkchangyi copulamethodformodelingdirectionaldependenceofgenes
AT sohninsuk copulamethodformodelingdirectionaldependenceofgenes