Cargando…
Microbial species delineation using whole genome sequences
Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wid...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538840/ https://www.ncbi.nlm.nih.gov/pubmed/26150420 http://dx.doi.org/10.1093/nar/gkv657 |
_version_ | 1782386044665069568 |
---|---|
author | Varghese, Neha J. Mukherjee, Supratim Ivanova, Natalia Konstantinidis, Konstantinos T. Mavrommatis, Kostas Kyrpides, Nikos C. Pati, Amrita |
author_facet | Varghese, Neha J. Mukherjee, Supratim Ivanova, Natalia Konstantinidis, Konstantinos T. Mavrommatis, Kostas Kyrpides, Nikos C. Pati, Amrita |
author_sort | Varghese, Neha J. |
collection | PubMed |
description | Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required. |
format | Online Article Text |
id | pubmed-4538840 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45388402015-08-18 Microbial species delineation using whole genome sequences Varghese, Neha J. Mukherjee, Supratim Ivanova, Natalia Konstantinidis, Konstantinos T. Mavrommatis, Kostas Kyrpides, Nikos C. Pati, Amrita Nucleic Acids Res Computational Biology Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required. Oxford University Press 2015-08-18 2015-07-06 /pmc/articles/PMC4538840/ /pubmed/26150420 http://dx.doi.org/10.1093/nar/gkv657 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Varghese, Neha J. Mukherjee, Supratim Ivanova, Natalia Konstantinidis, Konstantinos T. Mavrommatis, Kostas Kyrpides, Nikos C. Pati, Amrita Microbial species delineation using whole genome sequences |
title | Microbial species delineation using whole genome sequences |
title_full | Microbial species delineation using whole genome sequences |
title_fullStr | Microbial species delineation using whole genome sequences |
title_full_unstemmed | Microbial species delineation using whole genome sequences |
title_short | Microbial species delineation using whole genome sequences |
title_sort | microbial species delineation using whole genome sequences |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538840/ https://www.ncbi.nlm.nih.gov/pubmed/26150420 http://dx.doi.org/10.1093/nar/gkv657 |
work_keys_str_mv | AT varghesenehaj microbialspeciesdelineationusingwholegenomesequences AT mukherjeesupratim microbialspeciesdelineationusingwholegenomesequences AT ivanovanatalia microbialspeciesdelineationusingwholegenomesequences AT konstantinidiskonstantinost microbialspeciesdelineationusingwholegenomesequences AT mavrommatiskostas microbialspeciesdelineationusingwholegenomesequences AT kyrpidesnikosc microbialspeciesdelineationusingwholegenomesequences AT patiamrita microbialspeciesdelineationusingwholegenomesequences |