Cargando…

Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks

The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval a...

Descripción completa

Detalles Bibliográficos
Autores principales: Deeter, Anthony, Dalman, Mark, Haddad, Joseph, Duan, Zhong-Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648141/
https://www.ncbi.nlm.nih.gov/pubmed/29049295
http://dx.doi.org/10.1371/journal.pone.0186004
_version_ 1783272344792334336
author Deeter, Anthony
Dalman, Mark
Haddad, Joseph
Duan, Zhong-Hui
author_facet Deeter, Anthony
Dalman, Mark
Haddad, Joseph
Duan, Zhong-Hui
author_sort Deeter, Anthony
collection PubMed
description The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.
format Online
Article
Text
id pubmed-5648141
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56481412017-11-03 Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks Deeter, Anthony Dalman, Mark Haddad, Joseph Duan, Zhong-Hui PLoS One Research Article The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways. Public Library of Science 2017-10-19 /pmc/articles/PMC5648141/ /pubmed/29049295 http://dx.doi.org/10.1371/journal.pone.0186004 Text en © 2017 Deeter et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Deeter, Anthony
Dalman, Mark
Haddad, Joseph
Duan, Zhong-Hui
Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
title Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
title_full Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
title_fullStr Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
title_full_unstemmed Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
title_short Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
title_sort inferring gene and protein interactions using pubmed citations and consensus bayesian networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648141/
https://www.ncbi.nlm.nih.gov/pubmed/29049295
http://dx.doi.org/10.1371/journal.pone.0186004
work_keys_str_mv AT deeteranthony inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
AT dalmanmark inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
AT haddadjoseph inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks
AT duanzhonghui inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks