Cargando…
Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
BACKGROUND: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5348867/ https://www.ncbi.nlm.nih.gov/pubmed/28288685 http://dx.doi.org/10.1186/s13326-017-0122-4 |
_version_ | 1782514342036504576 |
---|---|
author | Hur, Junguk Özgür, Arzucan He, Yongqun |
author_facet | Hur, Junguk Özgür, Arzucan He, Yongqun |
author_sort | Hur, Junguk |
collection | PubMed |
description | BACKGROUND: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. METHODS: In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. RESULTS: Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e.g., regulation and gene expression). CONCLUSIONS: Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-017-0122-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5348867 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53488672017-03-14 Ontology-based literature mining of E. coli vaccine-associated gene interaction networks Hur, Junguk Özgür, Arzucan He, Yongqun J Biomed Semantics Research BACKGROUND: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. METHODS: In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. RESULTS: Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e.g., regulation and gene expression). CONCLUSIONS: Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-017-0122-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-14 /pmc/articles/PMC5348867/ /pubmed/28288685 http://dx.doi.org/10.1186/s13326-017-0122-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hur, Junguk Özgür, Arzucan He, Yongqun Ontology-based literature mining of E. coli vaccine-associated gene interaction networks |
title | Ontology-based literature mining of E. coli vaccine-associated gene interaction networks |
title_full | Ontology-based literature mining of E. coli vaccine-associated gene interaction networks |
title_fullStr | Ontology-based literature mining of E. coli vaccine-associated gene interaction networks |
title_full_unstemmed | Ontology-based literature mining of E. coli vaccine-associated gene interaction networks |
title_short | Ontology-based literature mining of E. coli vaccine-associated gene interaction networks |
title_sort | ontology-based literature mining of e. coli vaccine-associated gene interaction networks |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5348867/ https://www.ncbi.nlm.nih.gov/pubmed/28288685 http://dx.doi.org/10.1186/s13326-017-0122-4 |
work_keys_str_mv | AT hurjunguk ontologybasedliteratureminingofecolivaccineassociatedgeneinteractionnetworks AT ozgurarzucan ontologybasedliteratureminingofecolivaccineassociatedgeneinteractionnetworks AT heyongqun ontologybasedliteratureminingofecolivaccineassociatedgeneinteractionnetworks |