Cargando…

Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype

BACKGROUND: RNA sequencing (RNA-seq) technology has identified multiple differentially expressed (DE) genes associated to complex disease, however, these genes only explain a modest part of variance. Omnigenic model assumes that disease may be driven by genes with indirect relevance to disease and b...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Fang, Wang, Yaqi, Zhao, Yang, Yang, Sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6423879/
https://www.ncbi.nlm.nih.gov/pubmed/30890140
http://dx.doi.org/10.1186/s12863-019-0739-7
_version_ 1783404607149441024
author Shao, Fang
Wang, Yaqi
Zhao, Yang
Yang, Sheng
author_facet Shao, Fang
Wang, Yaqi
Zhao, Yang
Yang, Sheng
author_sort Shao, Fang
collection PubMed
description BACKGROUND: RNA sequencing (RNA-seq) technology has identified multiple differentially expressed (DE) genes associated to complex disease, however, these genes only explain a modest part of variance. Omnigenic model assumes that disease may be driven by genes with indirect relevance to disease and be propagated by functional pathways. Here, we focus on identifying the interactions between the external genes and functional pathways, referring to gene-pathway interactions (GPIs). Specifically, relying on the relationship between the garrote kernel machine (GKM) and variance component test and permutations for the empirical distributions of score statistics, we propose an efficient analysis procedure as Permutation based gEne-pAthway interaction identification in binary phenotype (PEA). RESULTS: Various simulations show that PEA has well-calibrated type I error rates and higher power than the traditional likelihood ratio test (LRT). In addition, we perform the gene set enrichment algorithms and PEA to identifying the GPIs from a pan-cancer data (GES68086). These GPIs and genes possibly further illustrate the potential etiology of cancers, most of which are identified and some external genes and significant pathways are consistent with previous studies. CONCLUSIONS: PEA is an efficient tool for identifying the GPIs from RNA-seq data. It can be further extended to identify the interactions between one variable and one functional set of other omics data for binary phenotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12863-019-0739-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6423879
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64238792019-03-28 Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype Shao, Fang Wang, Yaqi Zhao, Yang Yang, Sheng BMC Genet Methodology Article BACKGROUND: RNA sequencing (RNA-seq) technology has identified multiple differentially expressed (DE) genes associated to complex disease, however, these genes only explain a modest part of variance. Omnigenic model assumes that disease may be driven by genes with indirect relevance to disease and be propagated by functional pathways. Here, we focus on identifying the interactions between the external genes and functional pathways, referring to gene-pathway interactions (GPIs). Specifically, relying on the relationship between the garrote kernel machine (GKM) and variance component test and permutations for the empirical distributions of score statistics, we propose an efficient analysis procedure as Permutation based gEne-pAthway interaction identification in binary phenotype (PEA). RESULTS: Various simulations show that PEA has well-calibrated type I error rates and higher power than the traditional likelihood ratio test (LRT). In addition, we perform the gene set enrichment algorithms and PEA to identifying the GPIs from a pan-cancer data (GES68086). These GPIs and genes possibly further illustrate the potential etiology of cancers, most of which are identified and some external genes and significant pathways are consistent with previous studies. CONCLUSIONS: PEA is an efficient tool for identifying the GPIs from RNA-seq data. It can be further extended to identify the interactions between one variable and one functional set of other omics data for binary phenotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12863-019-0739-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-03-19 /pmc/articles/PMC6423879/ /pubmed/30890140 http://dx.doi.org/10.1186/s12863-019-0739-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Shao, Fang
Wang, Yaqi
Zhao, Yang
Yang, Sheng
Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype
title Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype
title_full Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype
title_fullStr Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype
title_full_unstemmed Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype
title_short Identifying and exploiting gene-pathway interactions from RNA-seq data for binary phenotype
title_sort identifying and exploiting gene-pathway interactions from rna-seq data for binary phenotype
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6423879/
https://www.ncbi.nlm.nih.gov/pubmed/30890140
http://dx.doi.org/10.1186/s12863-019-0739-7
work_keys_str_mv AT shaofang identifyingandexploitinggenepathwayinteractionsfromrnaseqdataforbinaryphenotype
AT wangyaqi identifyingandexploitinggenepathwayinteractionsfromrnaseqdataforbinaryphenotype
AT zhaoyang identifyingandexploitinggenepathwayinteractionsfromrnaseqdataforbinaryphenotype
AT yangsheng identifyingandexploitinggenepathwayinteractionsfromrnaseqdataforbinaryphenotype