Cargando…

Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support

Whole-genome comparisons based on average nucleotide identities (ANI) and the genome-to-genome distance calculator have risen to prominence in rapidly classifying prokaryotic taxa using whole-genome sequences. Some implementations have even been proposed as a new standard in species classification a...

Descripción completa

Detalles Bibliográficos
Autores principales: Gosselin, Sophia, Fullmer, Matthew S, Feng, Yutian, Gogarten, Johann Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830074/
https://www.ncbi.nlm.nih.gov/pubmed/34289044
http://dx.doi.org/10.1093/sysbio/syab060
_version_ 1784648203625299968
author Gosselin, Sophia
Fullmer, Matthew S
Feng, Yutian
Gogarten, Johann Peter
author_facet Gosselin, Sophia
Fullmer, Matthew S
Feng, Yutian
Gogarten, Johann Peter
author_sort Gosselin, Sophia
collection PubMed
description Whole-genome comparisons based on average nucleotide identities (ANI) and the genome-to-genome distance calculator have risen to prominence in rapidly classifying prokaryotic taxa using whole-genome sequences. Some implementations have even been proposed as a new standard in species classification and have become a common technique for papers describing newly sequenced genomes. However, attempts to apply whole-genome divergence data to the delineation of higher taxonomic units and to phylogenetic inference have had difficulty matching those produced by more complex phylogenetic methods. We present a novel method for generating statistically supported phylogenies of archaeal and bacterial groups using a combined ANI and alignment fraction-based metric. For the test cases to which we applied the developed approach, we obtained results comparable with other methodologies up to at least the family level. The developed method uses nonparametric bootstrapping to gauge support for inferred groups. This method offers the opportunity to make use of whole-genome comparison data, that is already being generated, to quickly produce phylogenies including support for inferred groups. Additionally, the developed ANI methodology can assist the classification of higher taxonomic groups.[Average nucleotide identity (ANI); genome evolution; prokaryotic species delineation; taxonomy.]
format Online
Article
Text
id pubmed-8830074
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88300742022-02-11 Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support Gosselin, Sophia Fullmer, Matthew S Feng, Yutian Gogarten, Johann Peter Syst Biol Regular Articles Whole-genome comparisons based on average nucleotide identities (ANI) and the genome-to-genome distance calculator have risen to prominence in rapidly classifying prokaryotic taxa using whole-genome sequences. Some implementations have even been proposed as a new standard in species classification and have become a common technique for papers describing newly sequenced genomes. However, attempts to apply whole-genome divergence data to the delineation of higher taxonomic units and to phylogenetic inference have had difficulty matching those produced by more complex phylogenetic methods. We present a novel method for generating statistically supported phylogenies of archaeal and bacterial groups using a combined ANI and alignment fraction-based metric. For the test cases to which we applied the developed approach, we obtained results comparable with other methodologies up to at least the family level. The developed method uses nonparametric bootstrapping to gauge support for inferred groups. This method offers the opportunity to make use of whole-genome comparison data, that is already being generated, to quickly produce phylogenies including support for inferred groups. Additionally, the developed ANI methodology can assist the classification of higher taxonomic groups.[Average nucleotide identity (ANI); genome evolution; prokaryotic species delineation; taxonomy.] Oxford University Press 2021-07-21 /pmc/articles/PMC8830074/ /pubmed/34289044 http://dx.doi.org/10.1093/sysbio/syab060 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Gosselin, Sophia
Fullmer, Matthew S
Feng, Yutian
Gogarten, Johann Peter
Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support
title Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support
title_full Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support
title_fullStr Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support
title_full_unstemmed Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support
title_short Improving Phylogenies Based on Average Nucleotide Identity, Incorporating Saturation Correction and Nonparametric Bootstrap Support
title_sort improving phylogenies based on average nucleotide identity, incorporating saturation correction and nonparametric bootstrap support
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8830074/
https://www.ncbi.nlm.nih.gov/pubmed/34289044
http://dx.doi.org/10.1093/sysbio/syab060
work_keys_str_mv AT gosselinsophia improvingphylogeniesbasedonaveragenucleotideidentityincorporatingsaturationcorrectionandnonparametricbootstrapsupport
AT fullmermatthews improvingphylogeniesbasedonaveragenucleotideidentityincorporatingsaturationcorrectionandnonparametricbootstrapsupport
AT fengyutian improvingphylogeniesbasedonaveragenucleotideidentityincorporatingsaturationcorrectionandnonparametricbootstrapsupport
AT gogartenjohannpeter improvingphylogeniesbasedonaveragenucleotideidentityincorporatingsaturationcorrectionandnonparametricbootstrapsupport