Cargando…
Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the p...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937193/ https://www.ncbi.nlm.nih.gov/pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151 |
_version_ | 1782441664189562880 |
---|---|
author | Blatti, Charles Sinha, Saurabh |
author_facet | Blatti, Charles Sinha, Saurabh |
author_sort | Blatti, Charles |
collection | PubMed |
description | Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene–gene or gene–property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. Results: We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. Availability and Implementation: DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. Contact: blatti@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4937193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-49371932016-07-11 Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks Blatti, Charles Sinha, Saurabh Bioinformatics Original Papers Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene–gene or gene–property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. Results: We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. Availability and Implementation: DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. Contact: blatti@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-07-15 2016-03-19 /pmc/articles/PMC4937193/ /pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Blatti, Charles Sinha, Saurabh Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
title | Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
title_full | Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
title_fullStr | Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
title_full_unstemmed | Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
title_short | Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
title_sort | characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937193/ https://www.ncbi.nlm.nih.gov/pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151 |
work_keys_str_mv | AT blatticharles characterizinggenesetsusingdiscriminativerandomwalkswithrestartonheterogeneousbiologicalnetworks AT sinhasaurabh characterizinggenesetsusingdiscriminativerandomwalkswithrestartonheterogeneousbiologicalnetworks |