Cargando…

Microbial species delineation using whole genome sequences

Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wid...

Descripción completa

Detalles Bibliográficos
Autores principales: Varghese, Neha J., Mukherjee, Supratim, Ivanova, Natalia, Konstantinidis, Konstantinos T., Mavrommatis, Kostas, Kyrpides, Nikos C., Pati, Amrita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538840/
https://www.ncbi.nlm.nih.gov/pubmed/26150420
http://dx.doi.org/10.1093/nar/gkv657
_version_ 1782386044665069568
author Varghese, Neha J.
Mukherjee, Supratim
Ivanova, Natalia
Konstantinidis, Konstantinos T.
Mavrommatis, Kostas
Kyrpides, Nikos C.
Pati, Amrita
author_facet Varghese, Neha J.
Mukherjee, Supratim
Ivanova, Natalia
Konstantinidis, Konstantinos T.
Mavrommatis, Kostas
Kyrpides, Nikos C.
Pati, Amrita
author_sort Varghese, Neha J.
collection PubMed
description Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required.
format Online
Article
Text
id pubmed-4538840
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45388402015-08-18 Microbial species delineation using whole genome sequences Varghese, Neha J. Mukherjee, Supratim Ivanova, Natalia Konstantinidis, Konstantinos T. Mavrommatis, Kostas Kyrpides, Nikos C. Pati, Amrita Nucleic Acids Res Computational Biology Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required. Oxford University Press 2015-08-18 2015-07-06 /pmc/articles/PMC4538840/ /pubmed/26150420 http://dx.doi.org/10.1093/nar/gkv657 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Varghese, Neha J.
Mukherjee, Supratim
Ivanova, Natalia
Konstantinidis, Konstantinos T.
Mavrommatis, Kostas
Kyrpides, Nikos C.
Pati, Amrita
Microbial species delineation using whole genome sequences
title Microbial species delineation using whole genome sequences
title_full Microbial species delineation using whole genome sequences
title_fullStr Microbial species delineation using whole genome sequences
title_full_unstemmed Microbial species delineation using whole genome sequences
title_short Microbial species delineation using whole genome sequences
title_sort microbial species delineation using whole genome sequences
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538840/
https://www.ncbi.nlm.nih.gov/pubmed/26150420
http://dx.doi.org/10.1093/nar/gkv657
work_keys_str_mv AT varghesenehaj microbialspeciesdelineationusingwholegenomesequences
AT mukherjeesupratim microbialspeciesdelineationusingwholegenomesequences
AT ivanovanatalia microbialspeciesdelineationusingwholegenomesequences
AT konstantinidiskonstantinost microbialspeciesdelineationusingwholegenomesequences
AT mavrommatiskostas microbialspeciesdelineationusingwholegenomesequences
AT kyrpidesnikosc microbialspeciesdelineationusingwholegenomesequences
AT patiamrita microbialspeciesdelineationusingwholegenomesequences