Cargando…

Ontology-based literature mining of E. coli vaccine-associated gene interaction networks

BACKGROUND: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine...

Descripción completa

Detalles Bibliográficos
Autores principales: Hur, Junguk, Özgür, Arzucan, He, Yongqun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5348867/
https://www.ncbi.nlm.nih.gov/pubmed/28288685
http://dx.doi.org/10.1186/s13326-017-0122-4
_version_ 1782514342036504576
author Hur, Junguk
Özgür, Arzucan
He, Yongqun
author_facet Hur, Junguk
Özgür, Arzucan
He, Yongqun
author_sort Hur, Junguk
collection PubMed
description BACKGROUND: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. METHODS: In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. RESULTS: Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e.g., regulation and gene expression). CONCLUSIONS: Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-017-0122-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5348867
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53488672017-03-14 Ontology-based literature mining of E. coli vaccine-associated gene interaction networks Hur, Junguk Özgür, Arzucan He, Yongqun J Biomed Semantics Research BACKGROUND: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. METHODS: In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. RESULTS: Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e.g., regulation and gene expression). CONCLUSIONS: Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13326-017-0122-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-14 /pmc/articles/PMC5348867/ /pubmed/28288685 http://dx.doi.org/10.1186/s13326-017-0122-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Hur, Junguk
Özgür, Arzucan
He, Yongqun
Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
title Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
title_full Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
title_fullStr Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
title_full_unstemmed Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
title_short Ontology-based literature mining of E. coli vaccine-associated gene interaction networks
title_sort ontology-based literature mining of e. coli vaccine-associated gene interaction networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5348867/
https://www.ncbi.nlm.nih.gov/pubmed/28288685
http://dx.doi.org/10.1186/s13326-017-0122-4
work_keys_str_mv AT hurjunguk ontologybasedliteratureminingofecolivaccineassociatedgeneinteractionnetworks
AT ozgurarzucan ontologybasedliteratureminingofecolivaccineassociatedgeneinteractionnetworks
AT heyongqun ontologybasedliteratureminingofecolivaccineassociatedgeneinteractionnetworks