Cargando…

Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences

Bacterial phylogenetic analyses are commonly performed to explore the evolutionary relationships among various bacterial species and genera based on their 16S rRNA gene sequences; however, these results are limited by mosaicism, intragenomic heterogeneity, and difficulties in distinguishing between...

Descripción completa

Detalles Bibliográficos
Autores principales: Nakano, Yoshio, Domon, Yusaku, Yamagishi, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10118083/
https://www.ncbi.nlm.nih.gov/pubmed/37079522
http://dx.doi.org/10.1371/journal.pone.0268847
_version_ 1785028730197901312
author Nakano, Yoshio
Domon, Yusaku
Yamagishi, Kenji
author_facet Nakano, Yoshio
Domon, Yusaku
Yamagishi, Kenji
author_sort Nakano, Yoshio
collection PubMed
description Bacterial phylogenetic analyses are commonly performed to explore the evolutionary relationships among various bacterial species and genera based on their 16S rRNA gene sequences; however, these results are limited by mosaicism, intragenomic heterogeneity, and difficulties in distinguishing between related species. In this study, we aimed to perform genome-wide comparisons of different bacterial species, namely Escherichia coli, Shigella, Yersinia, Klebsiella, and Neisseria spp., based on their K-mer profiles to construct phylogenetic trees. Pentanucleotide frequency analyses (512 patterns of 5 nucleotides each) were performed to distinguish between highly similar species. Moreover, Escherichia albertii strains were clearly distinguished from E. coli and Shigella, despite being closely related to enterohemorrhagic E. coli in the phylogenetic tree. In addition, our phylogenetic tree of Ipomoea species based on pentamer frequency in chloroplast genomes was correlated with previously reported morphological similarities. Furthermore, a support vector machine clearly classified E. coli and Shigella genomes based on their pentanucleotide profiles. These results suggest that phylogenetic analyses based on penta- or hexamer profiles are a useful methodology for microbial phylogenetic studies. In addition, we introduced an R application, Phy5, which generates a phylogenetic tree based on genome-wide comparisons of pentamer profiles. The online version of Phy5 can be accessed at https://phy5.shinyapps.io/Phy5R/ and its command line version Phy5cli can be downloaded at https://github.com/YoshioNakano2021/phy5.
format Online
Article
Text
id pubmed-10118083
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-101180832023-04-21 Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences Nakano, Yoshio Domon, Yusaku Yamagishi, Kenji PLoS One Research Article Bacterial phylogenetic analyses are commonly performed to explore the evolutionary relationships among various bacterial species and genera based on their 16S rRNA gene sequences; however, these results are limited by mosaicism, intragenomic heterogeneity, and difficulties in distinguishing between related species. In this study, we aimed to perform genome-wide comparisons of different bacterial species, namely Escherichia coli, Shigella, Yersinia, Klebsiella, and Neisseria spp., based on their K-mer profiles to construct phylogenetic trees. Pentanucleotide frequency analyses (512 patterns of 5 nucleotides each) were performed to distinguish between highly similar species. Moreover, Escherichia albertii strains were clearly distinguished from E. coli and Shigella, despite being closely related to enterohemorrhagic E. coli in the phylogenetic tree. In addition, our phylogenetic tree of Ipomoea species based on pentamer frequency in chloroplast genomes was correlated with previously reported morphological similarities. Furthermore, a support vector machine clearly classified E. coli and Shigella genomes based on their pentanucleotide profiles. These results suggest that phylogenetic analyses based on penta- or hexamer profiles are a useful methodology for microbial phylogenetic studies. In addition, we introduced an R application, Phy5, which generates a phylogenetic tree based on genome-wide comparisons of pentamer profiles. The online version of Phy5 can be accessed at https://phy5.shinyapps.io/Phy5R/ and its command line version Phy5cli can be downloaded at https://github.com/YoshioNakano2021/phy5. Public Library of Science 2023-04-20 /pmc/articles/PMC10118083/ /pubmed/37079522 http://dx.doi.org/10.1371/journal.pone.0268847 Text en © 2023 Nakano et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Nakano, Yoshio
Domon, Yusaku
Yamagishi, Kenji
Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
title Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
title_full Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
title_fullStr Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
title_full_unstemmed Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
title_short Phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
title_sort phylogenetic trees of closely related bacterial species and subspecies based on frequencies of short nucleotide sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10118083/
https://www.ncbi.nlm.nih.gov/pubmed/37079522
http://dx.doi.org/10.1371/journal.pone.0268847
work_keys_str_mv AT nakanoyoshio phylogenetictreesofcloselyrelatedbacterialspeciesandsubspeciesbasedonfrequenciesofshortnucleotidesequences
AT domonyusaku phylogenetictreesofcloselyrelatedbacterialspeciesandsubspeciesbasedonfrequenciesofshortnucleotidesequences
AT yamagishikenji phylogenetictreesofcloselyrelatedbacterialspeciesandsubspeciesbasedonfrequenciesofshortnucleotidesequences