Cargando…

Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks

Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the p...

Descripción completa

Detalles Bibliográficos
Autores principales: Blatti, Charles, Sinha, Saurabh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937193/
https://www.ncbi.nlm.nih.gov/pubmed/27153592
http://dx.doi.org/10.1093/bioinformatics/btw151
_version_ 1782441664189562880
author Blatti, Charles
Sinha, Saurabh
author_facet Blatti, Charles
Sinha, Saurabh
author_sort Blatti, Charles
collection PubMed
description Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene–gene or gene–property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. Results: We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. Availability and Implementation: DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. Contact: blatti@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4937193
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49371932016-07-11 Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks Blatti, Charles Sinha, Saurabh Bioinformatics Original Papers Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene–gene or gene–property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. Results: We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. Availability and Implementation: DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. Contact: blatti@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-07-15 2016-03-19 /pmc/articles/PMC4937193/ /pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Blatti, Charles
Sinha, Saurabh
Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_full Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_fullStr Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_full_unstemmed Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_short Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_sort characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937193/
https://www.ncbi.nlm.nih.gov/pubmed/27153592
http://dx.doi.org/10.1093/bioinformatics/btw151
work_keys_str_mv AT blatticharles characterizinggenesetsusingdiscriminativerandomwalkswithrestartonheterogeneousbiologicalnetworks
AT sinhasaurabh characterizinggenesetsusingdiscriminativerandomwalkswithrestartonheterogeneousbiologicalnetworks