Cargando…

Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem

BACKGROUND: Since the advent of microarray technology, numerous methods have been devised to infer gene regulatory relationships from gene expression data. Many approaches that infer entire regulatory networks. This produces results that are rich in information and yet so complex that they are often...

Descripción completa

Detalles Bibliográficos
Autores principales: Jensen, James D, Jensen, Daniel M, Clement, Mark J, Snell, Quinn O
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3849606/
https://www.ncbi.nlm.nih.gov/pubmed/24266986
http://dx.doi.org/10.1186/1471-2105-14-S13-S5
_version_ 1782293960270544896
author Jensen, James D
Jensen, Daniel M
Clement, Mark J
Snell, Quinn O
author_facet Jensen, James D
Jensen, Daniel M
Clement, Mark J
Snell, Quinn O
author_sort Jensen, James D
collection PubMed
description BACKGROUND: Since the advent of microarray technology, numerous methods have been devised to infer gene regulatory relationships from gene expression data. Many approaches that infer entire regulatory networks. This produces results that are rich in information and yet so complex that they are often of limited usefulness for researchers. One alternative unit of regulatory interactions is a linear path between genes. Linear paths are more comprehensible than networks and still contain important information. Such paths can be extracted from inferred regulatory networks or inferred directly. Since criteria for inferring networks generally differs from criteria for inferring paths, indirect and direct inference of paths may achieve different results. RESULTS: This paper explores a strategy to infer linear pathways by converting the path inference problem into a shortest-path problem. The edge weights used are the negative log-transformed probabilities of directness derived from the posterior joint distributions of pairwise mutual information between gene expression levels. Directness is inferred using the data processing inequality. The method was designed with two goals. One is to achieve better accuracy in path inference than extraction of paths from inferred networks. The other is to facilitate priorization of interactions for laboratory validation. A method is proposed for achieving this by ranking paths according to the joint probability of directness of each path's edges. The algorithm is evaluated using simulated expression data and is compared to extraction of shortest paths from networks inferred by two alternative methods, ARACNe and a minimum spanning tree algorithm. CONCLUSIONS: Direct path inference appears to achieve accuracy competitive with that obtained by extracting paths from networks inferred by the other methods. Preliminary exploration of the use of joint edge probabilities to rank paths is largely inconclusive. Suggestions for a better framework for such comparisons are discussed.
format Online
Article
Text
id pubmed-3849606
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38496062013-12-06 Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem Jensen, James D Jensen, Daniel M Clement, Mark J Snell, Quinn O BMC Bioinformatics Research BACKGROUND: Since the advent of microarray technology, numerous methods have been devised to infer gene regulatory relationships from gene expression data. Many approaches that infer entire regulatory networks. This produces results that are rich in information and yet so complex that they are often of limited usefulness for researchers. One alternative unit of regulatory interactions is a linear path between genes. Linear paths are more comprehensible than networks and still contain important information. Such paths can be extracted from inferred regulatory networks or inferred directly. Since criteria for inferring networks generally differs from criteria for inferring paths, indirect and direct inference of paths may achieve different results. RESULTS: This paper explores a strategy to infer linear pathways by converting the path inference problem into a shortest-path problem. The edge weights used are the negative log-transformed probabilities of directness derived from the posterior joint distributions of pairwise mutual information between gene expression levels. Directness is inferred using the data processing inequality. The method was designed with two goals. One is to achieve better accuracy in path inference than extraction of paths from inferred networks. The other is to facilitate priorization of interactions for laboratory validation. A method is proposed for achieving this by ranking paths according to the joint probability of directness of each path's edges. The algorithm is evaluated using simulated expression data and is compared to extraction of shortest paths from networks inferred by two alternative methods, ARACNe and a minimum spanning tree algorithm. CONCLUSIONS: Direct path inference appears to achieve accuracy competitive with that obtained by extracting paths from networks inferred by the other methods. Preliminary exploration of the use of joint edge probabilities to rank paths is largely inconclusive. Suggestions for a better framework for such comparisons are discussed. BioMed Central 2013-10-01 /pmc/articles/PMC3849606/ /pubmed/24266986 http://dx.doi.org/10.1186/1471-2105-14-S13-S5 Text en Copyright © 2013 Jensen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Jensen, James D
Jensen, Daniel M
Clement, Mark J
Snell, Quinn O
Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
title Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
title_full Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
title_fullStr Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
title_full_unstemmed Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
title_short Probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
title_sort probabilistic inference and ranking of gene regulatory pathways as a shortest-path problem
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3849606/
https://www.ncbi.nlm.nih.gov/pubmed/24266986
http://dx.doi.org/10.1186/1471-2105-14-S13-S5
work_keys_str_mv AT jensenjamesd probabilisticinferenceandrankingofgeneregulatorypathwaysasashortestpathproblem
AT jensendanielm probabilisticinferenceandrankingofgeneregulatorypathwaysasashortestpathproblem
AT clementmarkj probabilisticinferenceandrankingofgeneregulatorypathwaysasashortestpathproblem
AT snellquinno probabilisticinferenceandrankingofgeneregulatorypathwaysasashortestpathproblem