Cargando…
Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets
MOTIVATION: Deciphering gene interaction networks (GINs) from time-course gene expression (TCGx) data is highly valuable to understand gene behaviors (e.g., activation, inhibition, time-lagged causality) at the system level. Existing methods usually use a global or local proximity measure to infer G...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682423/ https://www.ncbi.nlm.nih.gov/pubmed/26681650 http://dx.doi.org/10.1186/1471-2164-16-S12-S4 |
_version_ | 1782405887856476160 |
---|---|
author | Liu, Qian Song, Renhua Li, Jinyan |
author_facet | Liu, Qian Song, Renhua Li, Jinyan |
author_sort | Liu, Qian |
collection | PubMed |
description | MOTIVATION: Deciphering gene interaction networks (GINs) from time-course gene expression (TCGx) data is highly valuable to understand gene behaviors (e.g., activation, inhibition, time-lagged causality) at the system level. Existing methods usually use a global or local proximity measure to infer GINs from a single dataset. As the noise contained in a single data set is hardly self-resolved, the results are sometimes not reliable. Also, these proximity measurements cannot handle the co-existence of the various in vivo positive, negative and time-lagged gene interactions. METHODS AND RESULTS: We propose to infer reliable GINs from multiple TCGx datasets using a novel conserved subsequential pattern of gene expression. A subsequential pattern is a maximal subset of genes sharing positive, negative or time-lagged correlations of one expression template on their own subsets of time points. Based on these patterns, a GIN can be built from each of the datasets. It is assumed that reliable gene interactions would be detected repeatedly. We thus use conserved gene pairs from the individual GINs of the multiple TCGx datasets to construct a reliable GIN for a species. We apply our method on six TCGx datasets related to yeast cell cycle, and validate the reliable GINs using protein interaction networks, biopathways and transcription factor-gene regulations. We also compare the reliable GINs with those GINs reconstructed by a global proximity measure Pearson correlation coefficient method from single datasets. It has been demonstrated that our reliable GINs achieve much better prediction performance especially with much higher precision. The functional enrichment analysis also suggests that gene sets in a reliable GIN are more functionally significant. Our method is especially useful to decipher GINs from multiple TCGx datasets related to less studied organisms where little knowledge is available except gene expression data. |
format | Online Article Text |
id | pubmed-4682423 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46824232015-12-21 Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets Liu, Qian Song, Renhua Li, Jinyan BMC Genomics Research MOTIVATION: Deciphering gene interaction networks (GINs) from time-course gene expression (TCGx) data is highly valuable to understand gene behaviors (e.g., activation, inhibition, time-lagged causality) at the system level. Existing methods usually use a global or local proximity measure to infer GINs from a single dataset. As the noise contained in a single data set is hardly self-resolved, the results are sometimes not reliable. Also, these proximity measurements cannot handle the co-existence of the various in vivo positive, negative and time-lagged gene interactions. METHODS AND RESULTS: We propose to infer reliable GINs from multiple TCGx datasets using a novel conserved subsequential pattern of gene expression. A subsequential pattern is a maximal subset of genes sharing positive, negative or time-lagged correlations of one expression template on their own subsets of time points. Based on these patterns, a GIN can be built from each of the datasets. It is assumed that reliable gene interactions would be detected repeatedly. We thus use conserved gene pairs from the individual GINs of the multiple TCGx datasets to construct a reliable GIN for a species. We apply our method on six TCGx datasets related to yeast cell cycle, and validate the reliable GINs using protein interaction networks, biopathways and transcription factor-gene regulations. We also compare the reliable GINs with those GINs reconstructed by a global proximity measure Pearson correlation coefficient method from single datasets. It has been demonstrated that our reliable GINs achieve much better prediction performance especially with much higher precision. The functional enrichment analysis also suggests that gene sets in a reliable GIN are more functionally significant. Our method is especially useful to decipher GINs from multiple TCGx datasets related to less studied organisms where little knowledge is available except gene expression data. BioMed Central 2015-12-09 /pmc/articles/PMC4682423/ /pubmed/26681650 http://dx.doi.org/10.1186/1471-2164-16-S12-S4 Text en Copyright © 2015 Liu et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Liu, Qian Song, Renhua Li, Jinyan Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
title | Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
title_full | Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
title_fullStr | Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
title_full_unstemmed | Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
title_short | Inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
title_sort | inference of gene interaction networks using conserved subsequential patterns from multiple time course gene expression datasets |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682423/ https://www.ncbi.nlm.nih.gov/pubmed/26681650 http://dx.doi.org/10.1186/1471-2164-16-S12-S4 |
work_keys_str_mv | AT liuqian inferenceofgeneinteractionnetworksusingconservedsubsequentialpatternsfrommultipletimecoursegeneexpressiondatasets AT songrenhua inferenceofgeneinteractionnetworksusingconservedsubsequentialpatternsfrommultipletimecoursegeneexpressiondatasets AT lijinyan inferenceofgeneinteractionnetworksusingconservedsubsequentialpatternsfrommultipletimecoursegeneexpressiondatasets |