Cargando…

Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait

BACKGROUND: The development of a disease is a complex process that may result from joint effects of multiple genes. In this article, we propose the overlapping group screening (OGS) approach to determining active genes and gene-gene interactions incorporating prior pathway information. The OGS metho...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jie-Huei, Chen, Yi-Hau
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150983/
https://www.ncbi.nlm.nih.gov/pubmed/30241463
http://dx.doi.org/10.1186/s12859-018-2372-2
_version_ 1783357078141665280
author Wang, Jie-Huei
Chen, Yi-Hau
author_facet Wang, Jie-Huei
Chen, Yi-Hau
author_sort Wang, Jie-Huei
collection PubMed
description BACKGROUND: The development of a disease is a complex process that may result from joint effects of multiple genes. In this article, we propose the overlapping group screening (OGS) approach to determining active genes and gene-gene interactions incorporating prior pathway information. The OGS method is developed to overcome the challenges in genome-wide data analysis that the number of the genes and gene-gene interactions is far greater than the sample size, and the pathways generally overlap with one another. The OGS method is further proposed for patients’ survival prediction based on gene expression data. RESULTS: Simulation studies demonstrate that the performance of the OGS approach in identifying the true main and interaction effects is good and the survival prediction accuracy of OGS with the Lasso penalty is better than the ordinary Lasso method. In real data analysis, we identify several significant genes and/or epistasis interactions that are associated with clinical survival outcomes of diffuse large B-cell lymphoma (DLBCL) and non-small-cell lung cancer (NSCLC) by utilizing prior pathway information from the KEGG pathway and the GO biological process databases, respectively. CONCLUSIONS: The OGS approach is useful for selecting important genes and epistasis interactions in the ultra-high dimensional feature space. The prediction ability of OGS with the Lasso penalty is better than existing methods. The OGS approach is generally applicable to various types of outcome data (quantitative, qualitative, censored event time data) and regression models (e.g. linear, logistic, and Cox’s regression models). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2372-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6150983
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61509832018-09-26 Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait Wang, Jie-Huei Chen, Yi-Hau BMC Bioinformatics Methodology Article BACKGROUND: The development of a disease is a complex process that may result from joint effects of multiple genes. In this article, we propose the overlapping group screening (OGS) approach to determining active genes and gene-gene interactions incorporating prior pathway information. The OGS method is developed to overcome the challenges in genome-wide data analysis that the number of the genes and gene-gene interactions is far greater than the sample size, and the pathways generally overlap with one another. The OGS method is further proposed for patients’ survival prediction based on gene expression data. RESULTS: Simulation studies demonstrate that the performance of the OGS approach in identifying the true main and interaction effects is good and the survival prediction accuracy of OGS with the Lasso penalty is better than the ordinary Lasso method. In real data analysis, we identify several significant genes and/or epistasis interactions that are associated with clinical survival outcomes of diffuse large B-cell lymphoma (DLBCL) and non-small-cell lung cancer (NSCLC) by utilizing prior pathway information from the KEGG pathway and the GO biological process databases, respectively. CONCLUSIONS: The OGS approach is useful for selecting important genes and epistasis interactions in the ultra-high dimensional feature space. The prediction ability of OGS with the Lasso penalty is better than existing methods. The OGS approach is generally applicable to various types of outcome data (quantitative, qualitative, censored event time data) and regression models (e.g. linear, logistic, and Cox’s regression models). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2372-2) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-21 /pmc/articles/PMC6150983/ /pubmed/30241463 http://dx.doi.org/10.1186/s12859-018-2372-2 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Jie-Huei
Chen, Yi-Hau
Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
title Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
title_full Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
title_fullStr Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
title_full_unstemmed Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
title_short Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
title_sort overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150983/
https://www.ncbi.nlm.nih.gov/pubmed/30241463
http://dx.doi.org/10.1186/s12859-018-2372-2
work_keys_str_mv AT wangjiehuei overlappinggroupscreeningfordetectionofgenegeneinteractionsapplicationtogeneexpressionprofileswithsurvivaltrait
AT chenyihau overlappinggroupscreeningfordetectionofgenegeneinteractionsapplicationtogeneexpressionprofileswithsurvivaltrait