Cargando…
Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases
Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conf...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638204/ https://www.ncbi.nlm.nih.gov/pubmed/29023450 http://dx.doi.org/10.1371/journal.pcbi.1005580 |
_version_ | 1783270705602756608 |
---|---|
author | Mezlini, Aziz M. Goldenberg, Anna |
author_facet | Mezlini, Aziz M. Goldenberg, Anna |
author_sort | Mezlini, Aziz M. |
collection | PubMed |
description | Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios. |
format | Online Article Text |
id | pubmed-5638204 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-56382042017-11-03 Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases Mezlini, Aziz M. Goldenberg, Anna PLoS Comput Biol Research Article Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios. Public Library of Science 2017-10-12 /pmc/articles/PMC5638204/ /pubmed/29023450 http://dx.doi.org/10.1371/journal.pcbi.1005580 Text en © 2017 Mezlini, Goldenberg http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Mezlini, Aziz M. Goldenberg, Anna Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
title | Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
title_full | Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
title_fullStr | Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
title_full_unstemmed | Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
title_short | Incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
title_sort | incorporating networks in a probabilistic graphical model to find drivers for complex human diseases |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638204/ https://www.ncbi.nlm.nih.gov/pubmed/29023450 http://dx.doi.org/10.1371/journal.pcbi.1005580 |
work_keys_str_mv | AT mezliniazizm incorporatingnetworksinaprobabilisticgraphicalmodeltofinddriversforcomplexhumandiseases AT goldenberganna incorporatingnetworksinaprobabilisticgraphicalmodeltofinddriversforcomplexhumandiseases |