Cargando…

A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis

Motivation: Gene set enrichment analysis is a widely accepted expression analysis tool which aims at detecting coordinated expression change within a pre-defined gene sets rather than individual genes. The benefit of gene set analysis over individual differentially expressed (DE) gene analysis inclu...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Wenyi, Wang, Xujun, Zhao, Hongyu, Lu, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6473067/
https://www.ncbi.nlm.nih.gov/pubmed/31031796
http://dx.doi.org/10.3389/fgene.2019.00293
_version_ 1783412350092574720
author Qin, Wenyi
Wang, Xujun
Zhao, Hongyu
Lu, Hui
author_facet Qin, Wenyi
Wang, Xujun
Zhao, Hongyu
Lu, Hui
author_sort Qin, Wenyi
collection PubMed
description Motivation: Gene set enrichment analysis is a widely accepted expression analysis tool which aims at detecting coordinated expression change within a pre-defined gene sets rather than individual genes. The benefit of gene set analysis over individual differentially expressed (DE) gene analysis includes more reproducible and interpretable results and detecting small but consistent change among gene set which could not be detected by DE gene analysis. There have been many successful gene set analysis applications in human diseases. However, when the sample size of a disease study is small and no other public data sets of the same disease are available, it will lead to lack of power to detect pathways of importance to the disease. Results: We have developed a novel joint gene set analysis statistical framework which aims at improving the power of identifying enriched gene sets through integrating multiple similar disease data sets. Through comprehensive simulation studies, we demonstrated that our proposed frameworks obtained much better AUC scores than single data set analysis and another meta-analysis method in identification of enriched pathways. When applied to two real data sets, the proposed framework could retain the enriched gene sets identified by single data set analysis and exclusively obtained up to 200% more disease-related gene sets demonstrating the improved identification power through information shared between similar diseases. We expect that the proposed framework would enable researchers to better explore public data sets when the sample size of their study is limited.
format Online
Article
Text
id pubmed-6473067
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64730672019-04-26 A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis Qin, Wenyi Wang, Xujun Zhao, Hongyu Lu, Hui Front Genet Genetics Motivation: Gene set enrichment analysis is a widely accepted expression analysis tool which aims at detecting coordinated expression change within a pre-defined gene sets rather than individual genes. The benefit of gene set analysis over individual differentially expressed (DE) gene analysis includes more reproducible and interpretable results and detecting small but consistent change among gene set which could not be detected by DE gene analysis. There have been many successful gene set analysis applications in human diseases. However, when the sample size of a disease study is small and no other public data sets of the same disease are available, it will lead to lack of power to detect pathways of importance to the disease. Results: We have developed a novel joint gene set analysis statistical framework which aims at improving the power of identifying enriched gene sets through integrating multiple similar disease data sets. Through comprehensive simulation studies, we demonstrated that our proposed frameworks obtained much better AUC scores than single data set analysis and another meta-analysis method in identification of enriched pathways. When applied to two real data sets, the proposed framework could retain the enriched gene sets identified by single data set analysis and exclusively obtained up to 200% more disease-related gene sets demonstrating the improved identification power through information shared between similar diseases. We expect that the proposed framework would enable researchers to better explore public data sets when the sample size of their study is limited. Frontiers Media S.A. 2019-04-12 /pmc/articles/PMC6473067/ /pubmed/31031796 http://dx.doi.org/10.3389/fgene.2019.00293 Text en Copyright © 2019 Qin, Wang, Zhao and Lu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Qin, Wenyi
Wang, Xujun
Zhao, Hongyu
Lu, Hui
A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
title A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
title_full A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
title_fullStr A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
title_full_unstemmed A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
title_short A Novel Joint Gene Set Analysis Framework Improves Identification of Enriched Pathways in Cross Disease Transcriptomic Analysis
title_sort novel joint gene set analysis framework improves identification of enriched pathways in cross disease transcriptomic analysis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6473067/
https://www.ncbi.nlm.nih.gov/pubmed/31031796
http://dx.doi.org/10.3389/fgene.2019.00293
work_keys_str_mv AT qinwenyi anoveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT wangxujun anoveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT zhaohongyu anoveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT luhui anoveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT qinwenyi noveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT wangxujun noveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT zhaohongyu noveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis
AT luhui noveljointgenesetanalysisframeworkimprovesidentificationofenrichedpathwaysincrossdiseasetranscriptomicanalysis