Cargando…

Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data

BACKGROUND: The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly foc...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Hyeonjeong, Shin, Miyoung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5286825/
https://www.ncbi.nlm.nih.gov/pubmed/28168005
http://dx.doi.org/10.1186/s13040-017-0127-7
_version_ 1782504071923499008
author Lee, Hyeonjeong
Shin, Miyoung
author_facet Lee, Hyeonjeong
Shin, Miyoung
author_sort Lee, Hyeonjeong
collection PubMed
description BACKGROUND: The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. RESULTS: The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. CONCLUSIONS: Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into distinctive associations between pathway activities in case and control samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0127-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5286825
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52868252017-02-06 Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data Lee, Hyeonjeong Shin, Miyoung BioData Min Methodology BACKGROUND: The problem of discovering genetic markers as disease signatures is of great significance for the successful diagnosis, treatment, and prognosis of complex diseases. Even if many earlier studies worked on identifying disease markers from a variety of biological resources, they mostly focused on the markers of genes or gene-sets (i.e., pathways). However, these markers may not be enough to explain biological interactions between genetic variables that are related to diseases. Thus, in this study, our aim is to investigate distinctive associations among active pathways (i.e., pathway-sets) shown each in case and control samples which can be observed from gene expression and/or methylation data. RESULTS: The pathway-sets are obtained by identifying a set of associated pathways that are often active together over a significant number of class samples. For this purpose, gene expression or methylation profiles are first analyzed to identify significant (active) pathways via gene-set enrichment analysis. Then, regarding these active pathways, an association rule mining approach is applied to examine interesting pathway-sets in each class of samples (case or control). By doing so, the sets of associated pathways often working together in activity profiles are finally chosen as our distinctive signature of each class. The identified pathway-sets are aggregated into a pathway activity network (PAN), which facilitates the visualization of differential pathway associations between case and control samples. From our experiments with two publicly available datasets, we could find interesting PAN structures as the distinctive signatures of breast cancer and uterine leiomyoma cancer, respectively. CONCLUSIONS: Our pathway-set markers were shown to be superior or very comparable to other genetic markers (such as genes or gene-sets) in disease classification. Furthermore, the PAN structure, which can be constructed from the identified markers of pathway-sets, could provide deeper insights into distinctive associations between pathway activities in case and control samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0127-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-01 /pmc/articles/PMC5286825/ /pubmed/28168005 http://dx.doi.org/10.1186/s13040-017-0127-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Lee, Hyeonjeong
Shin, Miyoung
Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
title Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
title_full Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
title_fullStr Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
title_full_unstemmed Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
title_short Mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
title_sort mining pathway associations for disease-related pathway activity analysis based on gene expression and methylation data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5286825/
https://www.ncbi.nlm.nih.gov/pubmed/28168005
http://dx.doi.org/10.1186/s13040-017-0127-7
work_keys_str_mv AT leehyeonjeong miningpathwayassociationsfordiseaserelatedpathwayactivityanalysisbasedongeneexpressionandmethylationdata
AT shinmiyoung miningpathwayassociationsfordiseaserelatedpathwayactivityanalysisbasedongeneexpressionandmethylationdata