Cargando…

Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans

Although there have been great advances in understanding bacterial pathogenesis, there is still a lack of integrative information about what makes a bacterium a human pathogen. The advent of high-throughput sequencing technologies has dramatically increased the amount of completed bacterial genomes,...

Descripción completa

Detalles Bibliográficos
Autores principales: Iraola, Gregorio, Vazquez, Gustavo, Spangenberg, Lucía, Naya, Hugo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3412846/
https://www.ncbi.nlm.nih.gov/pubmed/22916122
http://dx.doi.org/10.1371/journal.pone.0042144
_version_ 1782240003217162240
author Iraola, Gregorio
Vazquez, Gustavo
Spangenberg, Lucía
Naya, Hugo
author_facet Iraola, Gregorio
Vazquez, Gustavo
Spangenberg, Lucía
Naya, Hugo
author_sort Iraola, Gregorio
collection PubMed
description Although there have been great advances in understanding bacterial pathogenesis, there is still a lack of integrative information about what makes a bacterium a human pathogen. The advent of high-throughput sequencing technologies has dramatically increased the amount of completed bacterial genomes, for both known human pathogenic and non-pathogenic strains; this information is now available to investigate genetic features that determine pathogenic phenotypes in bacteria. In this work we determined presence/absence patterns of [Image: see text] different virulence-related genes among more than [Image: see text] finished bacterial genomes from both human pathogenic and non-pathogenic strains, belonging to different taxonomic groups (i.e: Actinobacteria, Gammaproteobacteria, Firmicutes, etc.). An accuracy of 95% using a cross-fold validation scheme with in-fold feature selection is obtained when classifying human pathogens and non-pathogens. A reduced subset of highly informative genes ([Image: see text]) is presented and applied to an external validation set. The statistical model was implemented in the BacFier v1.0 software (freely available at [Image: see text]), that displays not only the prediction (pathogen/non-pathogen) and an associated probability for pathogenicity, but also the presence/absence vector for the analyzed genes, so it is possible to decipher the subset of virulence genes responsible for the classification on the analyzed genome. Furthermore, we discuss the biological relevance for bacterial pathogenesis of the core set of genes, corresponding to eight functional categories, all with evident and documented association with the phenotypes of interest. Also, we analyze which functional categories of virulence genes were more distinctive for pathogenicity in each taxonomic group, which seems to be a completely new kind of information and could lead to important evolutionary conclusions.
format Online
Article
Text
id pubmed-3412846
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34128462012-08-22 Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans Iraola, Gregorio Vazquez, Gustavo Spangenberg, Lucía Naya, Hugo PLoS One Research Article Although there have been great advances in understanding bacterial pathogenesis, there is still a lack of integrative information about what makes a bacterium a human pathogen. The advent of high-throughput sequencing technologies has dramatically increased the amount of completed bacterial genomes, for both known human pathogenic and non-pathogenic strains; this information is now available to investigate genetic features that determine pathogenic phenotypes in bacteria. In this work we determined presence/absence patterns of [Image: see text] different virulence-related genes among more than [Image: see text] finished bacterial genomes from both human pathogenic and non-pathogenic strains, belonging to different taxonomic groups (i.e: Actinobacteria, Gammaproteobacteria, Firmicutes, etc.). An accuracy of 95% using a cross-fold validation scheme with in-fold feature selection is obtained when classifying human pathogens and non-pathogens. A reduced subset of highly informative genes ([Image: see text]) is presented and applied to an external validation set. The statistical model was implemented in the BacFier v1.0 software (freely available at [Image: see text]), that displays not only the prediction (pathogen/non-pathogen) and an associated probability for pathogenicity, but also the presence/absence vector for the analyzed genes, so it is possible to decipher the subset of virulence genes responsible for the classification on the analyzed genome. Furthermore, we discuss the biological relevance for bacterial pathogenesis of the core set of genes, corresponding to eight functional categories, all with evident and documented association with the phenotypes of interest. Also, we analyze which functional categories of virulence genes were more distinctive for pathogenicity in each taxonomic group, which seems to be a completely new kind of information and could lead to important evolutionary conclusions. Public Library of Science 2012-08-06 /pmc/articles/PMC3412846/ /pubmed/22916122 http://dx.doi.org/10.1371/journal.pone.0042144 Text en © 2012 Iraola et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Iraola, Gregorio
Vazquez, Gustavo
Spangenberg, Lucía
Naya, Hugo
Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
title Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
title_full Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
title_fullStr Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
title_full_unstemmed Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
title_short Reduced Set of Virulence Genes Allows High Accuracy Prediction of Bacterial Pathogenicity in Humans
title_sort reduced set of virulence genes allows high accuracy prediction of bacterial pathogenicity in humans
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3412846/
https://www.ncbi.nlm.nih.gov/pubmed/22916122
http://dx.doi.org/10.1371/journal.pone.0042144
work_keys_str_mv AT iraolagregorio reducedsetofvirulencegenesallowshighaccuracypredictionofbacterialpathogenicityinhumans
AT vazquezgustavo reducedsetofvirulencegenesallowshighaccuracypredictionofbacterialpathogenicityinhumans
AT spangenberglucia reducedsetofvirulencegenesallowshighaccuracypredictionofbacterialpathogenicityinhumans
AT nayahugo reducedsetofvirulencegenesallowshighaccuracypredictionofbacterialpathogenicityinhumans