Cargando…
Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks
The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648141/ https://www.ncbi.nlm.nih.gov/pubmed/29049295 http://dx.doi.org/10.1371/journal.pone.0186004 |
_version_ | 1783272344792334336 |
---|---|
author | Deeter, Anthony Dalman, Mark Haddad, Joseph Duan, Zhong-Hui |
author_facet | Deeter, Anthony Dalman, Mark Haddad, Joseph Duan, Zhong-Hui |
author_sort | Deeter, Anthony |
collection | PubMed |
description | The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways. |
format | Online Article Text |
id | pubmed-5648141 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-56481412017-11-03 Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks Deeter, Anthony Dalman, Mark Haddad, Joseph Duan, Zhong-Hui PLoS One Research Article The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways. Public Library of Science 2017-10-19 /pmc/articles/PMC5648141/ /pubmed/29049295 http://dx.doi.org/10.1371/journal.pone.0186004 Text en © 2017 Deeter et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Deeter, Anthony Dalman, Mark Haddad, Joseph Duan, Zhong-Hui Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks |
title | Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks |
title_full | Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks |
title_fullStr | Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks |
title_full_unstemmed | Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks |
title_short | Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks |
title_sort | inferring gene and protein interactions using pubmed citations and consensus bayesian networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648141/ https://www.ncbi.nlm.nih.gov/pubmed/29049295 http://dx.doi.org/10.1371/journal.pone.0186004 |
work_keys_str_mv | AT deeteranthony inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks AT dalmanmark inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks AT haddadjoseph inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks AT duanzhonghui inferringgeneandproteininteractionsusingpubmedcitationsandconsensusbayesiannetworks |