Cargando…

Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network

BACKGROUND: Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-ho...

Descripción completa

Detalles Bibliográficos
Autores principales: Hur, Junguk, Xiang, Zuoshuang, Feldman, Eva L, He, Yongqun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3180695/
https://www.ncbi.nlm.nih.gov/pubmed/21871085
http://dx.doi.org/10.1186/1471-2172-12-49
_version_ 1782212680342306816
author Hur, Junguk
Xiang, Zuoshuang
Feldman, Eva L
He, Yongqun
author_facet Hur, Junguk
Xiang, Zuoshuang
Feldman, Eva L
He, Yongqun
author_sort Hur, Junguk
collection PubMed
description BACKGROUND: Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals. RESULTS: The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were identified. The asserted and inferred VO hierarchies provide semantic support for inferring novel knowledge of association of vaccines and genes from the retrieved data. New hypotheses were generated based on this analysis approach. CONCLUSION: VO-SciMiner can be used to improve the efficiency for PubMed searching in the vaccine domain.
format Online
Article
Text
id pubmed-3180695
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31806952011-09-27 Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network Hur, Junguk Xiang, Zuoshuang Feldman, Eva L He, Yongqun BMC Immunol Research Article BACKGROUND: Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals. RESULTS: The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were identified. The asserted and inferred VO hierarchies provide semantic support for inferring novel knowledge of association of vaccines and genes from the retrieved data. New hypotheses were generated based on this analysis approach. CONCLUSION: VO-SciMiner can be used to improve the efficiency for PubMed searching in the vaccine domain. BioMed Central 2011-08-26 /pmc/articles/PMC3180695/ /pubmed/21871085 http://dx.doi.org/10.1186/1471-2172-12-49 Text en Copyright ©2011 Hur et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hur, Junguk
Xiang, Zuoshuang
Feldman, Eva L
He, Yongqun
Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
title Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
title_full Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
title_fullStr Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
title_full_unstemmed Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
title_short Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
title_sort ontology-based brucella vaccine literature indexing and systematic analysis of gene-vaccine association network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3180695/
https://www.ncbi.nlm.nih.gov/pubmed/21871085
http://dx.doi.org/10.1186/1471-2172-12-49
work_keys_str_mv AT hurjunguk ontologybasedbrucellavaccineliteratureindexingandsystematicanalysisofgenevaccineassociationnetwork
AT xiangzuoshuang ontologybasedbrucellavaccineliteratureindexingandsystematicanalysisofgenevaccineassociationnetwork
AT feldmaneval ontologybasedbrucellavaccineliteratureindexingandsystematicanalysisofgenevaccineassociationnetwork
AT heyongqun ontologybasedbrucellavaccineliteratureindexingandsystematicanalysisofgenevaccineassociationnetwork