Cargando…
Joint analysis of differential gene expression in multiple studies using correlation motifs
The standard methods for detecting differential gene expression are mostly designed for analyzing a single gene expression experiment. When data from multiple related gene expression studies are available, separately analyzing each study is not ideal as it may fail to detect important genes with con...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4263229/ https://www.ncbi.nlm.nih.gov/pubmed/25143368 http://dx.doi.org/10.1093/biostatistics/kxu038 |
_version_ | 1782348534728622080 |
---|---|
author | Wei, Yingying Tenzen, Toyoaki Ji, Hongkai |
author_facet | Wei, Yingying Tenzen, Toyoaki Ji, Hongkai |
author_sort | Wei, Yingying |
collection | PubMed |
description | The standard methods for detecting differential gene expression are mostly designed for analyzing a single gene expression experiment. When data from multiple related gene expression studies are available, separately analyzing each study is not ideal as it may fail to detect important genes with consistent but relatively weak differential signals in multiple studies. Jointly modeling all data allows one to borrow information across studies to improve the analysis. However, a simple concordance model, in which each gene is assumed to be differential in either all studies or none of the studies, is incapable of handling genes with study-specific differential expression. In contrast, a model that naively enumerates and analyzes all possible differential patterns across studies can deal with study-specificity and allow information pooling, but the complexity of its parameter space grows exponentially as the number of studies increases. Here, we propose a correlation motif approach to address this dilemma. This approach searches for a small number of latent probability vectors called correlation motifs to capture the major correlation patterns among multiple studies. The motifs provide the basis for sharing information among studies and genes. The approach has flexibility to handle all possible study-specific differential patterns. It improves detection of differential expression and overcomes the barrier of exponential model complexity. |
format | Online Article Text |
id | pubmed-4263229 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-42632292014-12-19 Joint analysis of differential gene expression in multiple studies using correlation motifs Wei, Yingying Tenzen, Toyoaki Ji, Hongkai Biostatistics Articles The standard methods for detecting differential gene expression are mostly designed for analyzing a single gene expression experiment. When data from multiple related gene expression studies are available, separately analyzing each study is not ideal as it may fail to detect important genes with consistent but relatively weak differential signals in multiple studies. Jointly modeling all data allows one to borrow information across studies to improve the analysis. However, a simple concordance model, in which each gene is assumed to be differential in either all studies or none of the studies, is incapable of handling genes with study-specific differential expression. In contrast, a model that naively enumerates and analyzes all possible differential patterns across studies can deal with study-specificity and allow information pooling, but the complexity of its parameter space grows exponentially as the number of studies increases. Here, we propose a correlation motif approach to address this dilemma. This approach searches for a small number of latent probability vectors called correlation motifs to capture the major correlation patterns among multiple studies. The motifs provide the basis for sharing information among studies and genes. The approach has flexibility to handle all possible study-specific differential patterns. It improves detection of differential expression and overcomes the barrier of exponential model complexity. Oxford University Press 2015-01 2014-08-19 /pmc/articles/PMC4263229/ /pubmed/25143368 http://dx.doi.org/10.1093/biostatistics/kxu038 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Wei, Yingying Tenzen, Toyoaki Ji, Hongkai Joint analysis of differential gene expression in multiple studies using correlation motifs |
title | Joint analysis of differential gene expression in multiple studies using correlation motifs |
title_full | Joint analysis of differential gene expression in multiple studies using correlation motifs |
title_fullStr | Joint analysis of differential gene expression in multiple studies using correlation motifs |
title_full_unstemmed | Joint analysis of differential gene expression in multiple studies using correlation motifs |
title_short | Joint analysis of differential gene expression in multiple studies using correlation motifs |
title_sort | joint analysis of differential gene expression in multiple studies using correlation motifs |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4263229/ https://www.ncbi.nlm.nih.gov/pubmed/25143368 http://dx.doi.org/10.1093/biostatistics/kxu038 |
work_keys_str_mv | AT weiyingying jointanalysisofdifferentialgeneexpressioninmultiplestudiesusingcorrelationmotifs AT tenzentoyoaki jointanalysisofdifferentialgeneexpressioninmultiplestudiesusingcorrelationmotifs AT jihongkai jointanalysisofdifferentialgeneexpressioninmultiplestudiesusingcorrelationmotifs |