Cargando…

Gene content dissimilarity for subclassification of highly similar microbial strains

BACKGROUND: Identification and classification of highly similar microbial strains is a challenging issue in microbiology, ecology and evolutionary biology. Among various available approaches, gene content analysis is also at the core of microbial taxonomy. However, no threshold has been determined f...

Descripción completa

Detalles Bibliográficos
Autores principales: Tu, Qichao, Lin, Lu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4988056/
https://www.ncbi.nlm.nih.gov/pubmed/27530250
http://dx.doi.org/10.1186/s12864-016-2991-9
_version_ 1782448400313090048
author Tu, Qichao
Lin, Lu
author_facet Tu, Qichao
Lin, Lu
author_sort Tu, Qichao
collection PubMed
description BACKGROUND: Identification and classification of highly similar microbial strains is a challenging issue in microbiology, ecology and evolutionary biology. Among various available approaches, gene content analysis is also at the core of microbial taxonomy. However, no threshold has been determined for grouping microorgnisms to different taxonomic levels, and it is still not clear that to what extent genomic fluidity should occur to form a microbial taxonomic group. RESULTS: By taking advantage of the eggNOG database for orthologous groups, we calculated gene content dissimilarity among different microbial strains based on the orthologous gene profiles and tested the possibility of applying gene content dissimilarity as a quantitative index in classifying microbial taxonomic groups, as well as its potential application in subclassification of highly similar microbial strains. Evaluation of gene content dissimilarity to completed microbial genomes at different taxonomic levels suggested that cutoffs of 0.2 and 0.4 can be respectively used for species and family delineation, and that 0.2 gene content dissimilarity cutoff approximately corresponded to 98 % 16S rRNA gene identity and 94 % ANI for microbial species delineation. Furthermore, application of gene content dissimilarity to highly similar microbial strains suggested it as an effective approach in classifying closely related microorganisms into subgroups. CONCLUSIONS: This approach is especially useful in identifying pathogens from commensals in clinical microbiology. It also provides novel insights into how genomic fluidity is linked with microbial taxonomy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2991-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4988056
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49880562016-08-18 Gene content dissimilarity for subclassification of highly similar microbial strains Tu, Qichao Lin, Lu BMC Genomics Research Article BACKGROUND: Identification and classification of highly similar microbial strains is a challenging issue in microbiology, ecology and evolutionary biology. Among various available approaches, gene content analysis is also at the core of microbial taxonomy. However, no threshold has been determined for grouping microorgnisms to different taxonomic levels, and it is still not clear that to what extent genomic fluidity should occur to form a microbial taxonomic group. RESULTS: By taking advantage of the eggNOG database for orthologous groups, we calculated gene content dissimilarity among different microbial strains based on the orthologous gene profiles and tested the possibility of applying gene content dissimilarity as a quantitative index in classifying microbial taxonomic groups, as well as its potential application in subclassification of highly similar microbial strains. Evaluation of gene content dissimilarity to completed microbial genomes at different taxonomic levels suggested that cutoffs of 0.2 and 0.4 can be respectively used for species and family delineation, and that 0.2 gene content dissimilarity cutoff approximately corresponded to 98 % 16S rRNA gene identity and 94 % ANI for microbial species delineation. Furthermore, application of gene content dissimilarity to highly similar microbial strains suggested it as an effective approach in classifying closely related microorganisms into subgroups. CONCLUSIONS: This approach is especially useful in identifying pathogens from commensals in clinical microbiology. It also provides novel insights into how genomic fluidity is linked with microbial taxonomy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2991-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-17 /pmc/articles/PMC4988056/ /pubmed/27530250 http://dx.doi.org/10.1186/s12864-016-2991-9 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Tu, Qichao
Lin, Lu
Gene content dissimilarity for subclassification of highly similar microbial strains
title Gene content dissimilarity for subclassification of highly similar microbial strains
title_full Gene content dissimilarity for subclassification of highly similar microbial strains
title_fullStr Gene content dissimilarity for subclassification of highly similar microbial strains
title_full_unstemmed Gene content dissimilarity for subclassification of highly similar microbial strains
title_short Gene content dissimilarity for subclassification of highly similar microbial strains
title_sort gene content dissimilarity for subclassification of highly similar microbial strains
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4988056/
https://www.ncbi.nlm.nih.gov/pubmed/27530250
http://dx.doi.org/10.1186/s12864-016-2991-9
work_keys_str_mv AT tuqichao genecontentdissimilarityforsubclassificationofhighlysimilarmicrobialstrains
AT linlu genecontentdissimilarityforsubclassificationofhighlysimilarmicrobialstrains