Cargando…

Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses

BACKGROUND: Mycobacterium tuberculosis, the causative agent of tuberculosis (TB), infects ~8 million annually culminating in ~2 million deaths. Moreover, about one third of the population is latently infected, 10% of which develop disease during lifetime. Current approved prophylactic TB vaccines (B...

Descripción completa

Detalles Bibliográficos
Autores principales: Zvi, Anat, Ariel, Naomi, Fulkerson, John, Sadoff, Jerald C, Shafferman, Avigdor
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2442614/
https://www.ncbi.nlm.nih.gov/pubmed/18505592
http://dx.doi.org/10.1186/1755-8794-1-18
_version_ 1782156712887713792
author Zvi, Anat
Ariel, Naomi
Fulkerson, John
Sadoff, Jerald C
Shafferman, Avigdor
author_facet Zvi, Anat
Ariel, Naomi
Fulkerson, John
Sadoff, Jerald C
Shafferman, Avigdor
author_sort Zvi, Anat
collection PubMed
description BACKGROUND: Mycobacterium tuberculosis, the causative agent of tuberculosis (TB), infects ~8 million annually culminating in ~2 million deaths. Moreover, about one third of the population is latently infected, 10% of which develop disease during lifetime. Current approved prophylactic TB vaccines (BCG and derivatives thereof) are of variable efficiency in adult protection against pulmonary TB (0%–80%), and directed essentially against early phase infection. METHODS: A genome-scale dataset was constructed by analyzing published data of: (1) global gene expression studies under conditions which simulate intra-macrophage stress, dormancy, persistence and/or reactivation; (2) cellular and humoral immunity, and vaccine potential. This information was compiled along with revised annotation/bioinformatic characterization of selected gene products and in silico mapping of T-cell epitopes. Protocols for scoring, ranking and prioritization of the antigens were developed and applied. RESULTS: Cross-matching of literature and in silico-derived data, in conjunction with the prioritization scheme and biological rationale, allowed for selection of 189 putative vaccine candidates from the entire genome. Within the 189 set, the relative distribution of antigens in 3 functional categories differs significantly from their distribution in the whole genome, with reduction in the Conserved hypothetical category (due to improved annotation) and enrichment in Lipid and in Virulence categories. Other prominent representatives in the 189 set are the PE/PPE proteins; iron sequestration, nitroreductases and proteases, all within the Intermediary metabolism and respiration category; ESX secretion systems, resuscitation promoting factors and lipoproteins, all within the Cell wall category. Application of a ranking scheme based on qualitative and quantitative scores, resulted in a list of 45 best-scoring antigens, of which: 74% belong to the dormancy/reactivation/resuscitation classes; 30% belong to the Cell wall category; 13% are classical vaccine candidates; 9% are categorized Conserved hypotheticals, all potentially very potent T-cell antigens. CONCLUSION: The comprehensive literature and in silico-based analyses allowed for the selection of a repertoire of 189 vaccine candidates, out of the whole-genome 3989 ORF products. This repertoire, which was ranked to generate a list of 45 top-hits antigens, is a platform for selection of genes covering all stages of M. tuberculosis infection, to be incorporated in rBCG or subunit-based vaccines.
format Text
id pubmed-2442614
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24426142008-07-02 Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses Zvi, Anat Ariel, Naomi Fulkerson, John Sadoff, Jerald C Shafferman, Avigdor BMC Med Genomics Research Article BACKGROUND: Mycobacterium tuberculosis, the causative agent of tuberculosis (TB), infects ~8 million annually culminating in ~2 million deaths. Moreover, about one third of the population is latently infected, 10% of which develop disease during lifetime. Current approved prophylactic TB vaccines (BCG and derivatives thereof) are of variable efficiency in adult protection against pulmonary TB (0%–80%), and directed essentially against early phase infection. METHODS: A genome-scale dataset was constructed by analyzing published data of: (1) global gene expression studies under conditions which simulate intra-macrophage stress, dormancy, persistence and/or reactivation; (2) cellular and humoral immunity, and vaccine potential. This information was compiled along with revised annotation/bioinformatic characterization of selected gene products and in silico mapping of T-cell epitopes. Protocols for scoring, ranking and prioritization of the antigens were developed and applied. RESULTS: Cross-matching of literature and in silico-derived data, in conjunction with the prioritization scheme and biological rationale, allowed for selection of 189 putative vaccine candidates from the entire genome. Within the 189 set, the relative distribution of antigens in 3 functional categories differs significantly from their distribution in the whole genome, with reduction in the Conserved hypothetical category (due to improved annotation) and enrichment in Lipid and in Virulence categories. Other prominent representatives in the 189 set are the PE/PPE proteins; iron sequestration, nitroreductases and proteases, all within the Intermediary metabolism and respiration category; ESX secretion systems, resuscitation promoting factors and lipoproteins, all within the Cell wall category. Application of a ranking scheme based on qualitative and quantitative scores, resulted in a list of 45 best-scoring antigens, of which: 74% belong to the dormancy/reactivation/resuscitation classes; 30% belong to the Cell wall category; 13% are classical vaccine candidates; 9% are categorized Conserved hypotheticals, all potentially very potent T-cell antigens. CONCLUSION: The comprehensive literature and in silico-based analyses allowed for the selection of a repertoire of 189 vaccine candidates, out of the whole-genome 3989 ORF products. This repertoire, which was ranked to generate a list of 45 top-hits antigens, is a platform for selection of genes covering all stages of M. tuberculosis infection, to be incorporated in rBCG or subunit-based vaccines. BioMed Central 2008-05-28 /pmc/articles/PMC2442614/ /pubmed/18505592 http://dx.doi.org/10.1186/1755-8794-1-18 Text en Copyright © 2008 Zvi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zvi, Anat
Ariel, Naomi
Fulkerson, John
Sadoff, Jerald C
Shafferman, Avigdor
Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
title Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
title_full Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
title_fullStr Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
title_full_unstemmed Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
title_short Whole genome identification of Mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
title_sort whole genome identification of mycobacterium tuberculosis vaccine candidates by comprehensive data mining and bioinformatic analyses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2442614/
https://www.ncbi.nlm.nih.gov/pubmed/18505592
http://dx.doi.org/10.1186/1755-8794-1-18
work_keys_str_mv AT zvianat wholegenomeidentificationofmycobacteriumtuberculosisvaccinecandidatesbycomprehensivedataminingandbioinformaticanalyses
AT arielnaomi wholegenomeidentificationofmycobacteriumtuberculosisvaccinecandidatesbycomprehensivedataminingandbioinformaticanalyses
AT fulkersonjohn wholegenomeidentificationofmycobacteriumtuberculosisvaccinecandidatesbycomprehensivedataminingandbioinformaticanalyses
AT sadoffjeraldc wholegenomeidentificationofmycobacteriumtuberculosisvaccinecandidatesbycomprehensivedataminingandbioinformaticanalyses
AT shaffermanavigdor wholegenomeidentificationofmycobacteriumtuberculosisvaccinecandidatesbycomprehensivedataminingandbioinformaticanalyses