Cargando…

Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases

Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conf...

Descripción completa

Detalles Bibliográficos
Autores principales: Mezlini, Aziz M., Goldenberg, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638204/
https://www.ncbi.nlm.nih.gov/pubmed/29023450
http://dx.doi.org/10.1371/journal.pcbi.1005580
_version_ 1783270705602756608
author Mezlini, Aziz M.
Goldenberg, Anna
author_facet Mezlini, Aziz M.
Goldenberg, Anna
author_sort Mezlini, Aziz M.
collection PubMed
description Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios.
format Online
Article
Text
id pubmed-5638204
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56382042017-11-03 Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases Mezlini, Aziz M. Goldenberg, Anna PLoS Comput Biol Research Article Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios. Public Library of Science 2017-10-12 /pmc/articles/PMC5638204/ /pubmed/29023450 http://dx.doi.org/10.1371/journal.pcbi.1005580 Text en © 2017 Mezlini, Goldenberg http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mezlini, Aziz M.
Goldenberg, Anna
Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
title Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
title_full Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
title_fullStr Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
title_full_unstemmed Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
title_short Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
title_sort incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638204/
https://www.ncbi.nlm.nih.gov/pubmed/29023450
http://dx.doi.org/10.1371/journal.pcbi.1005580
work_keys_str_mv AT mezliniazizm incorporatingnetworksinaprobabilisticgraphicalmodeltofinddriversforcomplexhumandiseases
AT goldenberganna incorporatingnetworksinaprobabilisticgraphicalmodeltofinddriversforcomplexhumandiseases