Cargando…
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
MOTIVATION: Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disea...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819186/ https://www.ncbi.nlm.nih.gov/pubmed/29467826 http://dx.doi.org/10.1186/s13040-018-0163-y |
_version_ | 1783301160687370240 |
---|---|
author | Qin, Wenyi Lu, Hui |
author_facet | Qin, Wenyi Lu, Hui |
author_sort | Qin, Wenyi |
collection | PubMed |
description | MOTIVATION: Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. RESULTS: We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0163-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5819186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58191862018-02-21 A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis Qin, Wenyi Lu, Hui BioData Min Methodology MOTIVATION: Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. RESULTS: We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0163-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-20 /pmc/articles/PMC5819186/ /pubmed/29467826 http://dx.doi.org/10.1186/s13040-018-0163-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Qin, Wenyi Lu, Hui A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title | A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_full | A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_fullStr | A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_full_unstemmed | A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_short | A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
title_sort | novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819186/ https://www.ncbi.nlm.nih.gov/pubmed/29467826 http://dx.doi.org/10.1186/s13040-018-0163-y |
work_keys_str_mv | AT qinwenyi anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis AT luhui anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis AT qinwenyi noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis AT luhui noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis |