Cargando…

Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks

Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the p...

Descripción completa

Detalles Bibliográficos
Autores principales:	Blatti, Charles, Sinha, Saurabh
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2016
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937193/ https://www.ncbi.nlm.nih.gov/pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151

_version_	1782441664189562880
author	Blatti, Charles Sinha, Saurabh
author_facet	Blatti, Charles Sinha, Saurabh
author_sort	Blatti, Charles
collection	PubMed
description	Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene–gene or gene–property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. Results: We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. Availability and Implementation: DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. Contact: blatti@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-4937193
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-49371932016-07-11 Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks Blatti, Charles Sinha, Saurabh Bioinformatics Original Papers Motivation: Analysis of co-expressed gene sets typically involves testing for enrichment of different annotations or ‘properties’ such as biological processes, pathways, transcription factor binding sites, etc., one property at a time. This common approach ignores any known relationships among the properties or the genes themselves. It is believed that known biological relationships among genes and their many properties may be exploited to more accurately reveal commonalities of a gene set. Previous work has sought to achieve this by building biological networks that combine multiple types of gene–gene or gene–property relationships, and performing network analysis to identify other genes and properties most relevant to a given gene set. Most existing network-based approaches for recognizing genes or annotations relevant to a given gene set collapse information about different properties to simplify (homogenize) the networks. Results: We present a network-based method for ranking genes or properties related to a given gene set. Such related genes or properties are identified from among the nodes of a large, heterogeneous network of biological information. Our method involves a random walk with restarts, performed on an initial network with multiple node and edge types that preserve more of the original, specific property information than current methods that operate on homogeneous networks. In this first stage of our algorithm, we find the properties that are the most relevant to the given gene set and extract a subnetwork of the original network, comprising only these relevant properties. We then re-rank genes by their similarity to the given gene set, based on a second random walk with restarts, performed on the above subnetwork. We demonstrate the effectiveness of this algorithm for ranking genes related to Drosophila embryonic development and aggressive responses in the brains of social animals. Availability and Implementation: DRaWR was implemented as an R package available at veda.cs.illinois.edu/DRaWR. Contact: blatti@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-07-15 2016-03-19 /pmc/articles/PMC4937193/ /pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Original Papers Blatti, Charles Sinha, Saurabh Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title	Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_full	Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_fullStr	Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_full_unstemmed	Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_short	Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
title_sort	characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937193/ https://www.ncbi.nlm.nih.gov/pubmed/27153592 http://dx.doi.org/10.1093/bioinformatics/btw151
work_keys_str_mv	AT blatticharles characterizinggenesetsusingdiscriminativerandomwalkswithrestartonheterogeneousbiologicalnetworks AT sinhasaurabh characterizinggenesetsusingdiscriminativerandomwalkswithrestartonheterogeneousbiologicalnetworks

Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks

Ejemplares similares