Cargando…

A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis

MOTIVATION: Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disea...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Wenyi, Lu, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819186/
https://www.ncbi.nlm.nih.gov/pubmed/29467826
http://dx.doi.org/10.1186/s13040-018-0163-y
_version_ 1783301160687370240
author Qin, Wenyi
Lu, Hui
author_facet Qin, Wenyi
Lu, Hui
author_sort Qin, Wenyi
collection PubMed
description MOTIVATION: Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. RESULTS: We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0163-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5819186
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58191862018-02-21 A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis Qin, Wenyi Lu, Hui BioData Min Methodology MOTIVATION: Detecting differentially expressed (DE) genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. RESULTS: We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13040-018-0163-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-20 /pmc/articles/PMC5819186/ /pubmed/29467826 http://dx.doi.org/10.1186/s13040-018-0163-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Qin, Wenyi
Lu, Hui
A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_full A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_fullStr A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_full_unstemmed A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_short A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
title_sort novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819186/
https://www.ncbi.nlm.nih.gov/pubmed/29467826
http://dx.doi.org/10.1186/s13040-018-0163-y
work_keys_str_mv AT qinwenyi anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
AT luhui anoveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
AT qinwenyi noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis
AT luhui noveljointanalysisframeworkimprovesidentificationofdifferentiallyexpressedgenesincrossdiseasetranscriptomicanalysis