Cargando…

Variable Region Sequences Influence 16S rRNA Performance

16S rRNA gene sequences are commonly analyzed for taxonomic and phylogenetic studies because they contain variable regions that can help distinguish different genera. However, intra-genus distinction using variable region homology is often impossible due to the high overall sequence identities among...

Descripción completa

Detalles Bibliográficos
Autores principales: Bose, Nikhil, Moore, Sean D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10269663/
https://www.ncbi.nlm.nih.gov/pubmed/37212673
http://dx.doi.org/10.1128/spectrum.01252-23
_version_ 1785059220534591488
author Bose, Nikhil
Moore, Sean D.
author_facet Bose, Nikhil
Moore, Sean D.
author_sort Bose, Nikhil
collection PubMed
description 16S rRNA gene sequences are commonly analyzed for taxonomic and phylogenetic studies because they contain variable regions that can help distinguish different genera. However, intra-genus distinction using variable region homology is often impossible due to the high overall sequence identities among closely related species, even though some residues may be conserved within respective species. Using a computational method that included the allelic diversity within individual genomes, we discovered that certain Escherichia and Shigella species can be distinguished by a multi-allelic 16S rRNA variable region single nucleotide polymorphism (SNP). To evaluate the performance of 16S rRNAs with altered variable regions, we developed an in vivo system that measures the acceptance and distribution of variant 16S rRNAs into a large pool of natural versions supporting normal translation and growth. We found that 16S rRNAs containing evolutionarily disparate variable regions were underpopulated both in ribosomes and in active translation pools, even for an SNP. Overall, this study revealed that variable region sequences can substantially influence the performance of 16S rRNAs and that this biological constraint can be leveraged to justify refining taxonomic assignments of variable region sequence data. IMPORTANCE This study reevaluates the notion that 16S rRNA gene variable region sequences are uninformative for intra-genus classification and that single nucleotide variations within them have no consequence to strains that bear them. We demonstrated that the performance of 16S rRNAs in Escherichia coli can be negatively impacted by sequence changes in variable regions, even for single nucleotide changes that are native to closely related Escherichia and Shigella species; thus, biological performance is likely constraining the evolution of variable regions in bacteria. Further, the native nucleotide variations we tested occur in all strains of their respective species and across their multiple 16S rRNA gene copies, suggesting that these species evolved beyond what would be discerned from a consensus sequence comparison. Therefore, this work also reveals that the multiple 16S rRNA gene alleles found in most bacteria can provide more informative phylogenetic and taxonomic detail than a single reference allele.
format Online
Article
Text
id pubmed-10269663
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-102696632023-06-16 Variable Region Sequences Influence 16S rRNA Performance Bose, Nikhil Moore, Sean D. Microbiol Spectr Research Article 16S rRNA gene sequences are commonly analyzed for taxonomic and phylogenetic studies because they contain variable regions that can help distinguish different genera. However, intra-genus distinction using variable region homology is often impossible due to the high overall sequence identities among closely related species, even though some residues may be conserved within respective species. Using a computational method that included the allelic diversity within individual genomes, we discovered that certain Escherichia and Shigella species can be distinguished by a multi-allelic 16S rRNA variable region single nucleotide polymorphism (SNP). To evaluate the performance of 16S rRNAs with altered variable regions, we developed an in vivo system that measures the acceptance and distribution of variant 16S rRNAs into a large pool of natural versions supporting normal translation and growth. We found that 16S rRNAs containing evolutionarily disparate variable regions were underpopulated both in ribosomes and in active translation pools, even for an SNP. Overall, this study revealed that variable region sequences can substantially influence the performance of 16S rRNAs and that this biological constraint can be leveraged to justify refining taxonomic assignments of variable region sequence data. IMPORTANCE This study reevaluates the notion that 16S rRNA gene variable region sequences are uninformative for intra-genus classification and that single nucleotide variations within them have no consequence to strains that bear them. We demonstrated that the performance of 16S rRNAs in Escherichia coli can be negatively impacted by sequence changes in variable regions, even for single nucleotide changes that are native to closely related Escherichia and Shigella species; thus, biological performance is likely constraining the evolution of variable regions in bacteria. Further, the native nucleotide variations we tested occur in all strains of their respective species and across their multiple 16S rRNA gene copies, suggesting that these species evolved beyond what would be discerned from a consensus sequence comparison. Therefore, this work also reveals that the multiple 16S rRNA gene alleles found in most bacteria can provide more informative phylogenetic and taxonomic detail than a single reference allele. American Society for Microbiology 2023-05-22 /pmc/articles/PMC10269663/ /pubmed/37212673 http://dx.doi.org/10.1128/spectrum.01252-23 Text en Copyright © 2023 Bose and Moore. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Bose, Nikhil
Moore, Sean D.
Variable Region Sequences Influence 16S rRNA Performance
title Variable Region Sequences Influence 16S rRNA Performance
title_full Variable Region Sequences Influence 16S rRNA Performance
title_fullStr Variable Region Sequences Influence 16S rRNA Performance
title_full_unstemmed Variable Region Sequences Influence 16S rRNA Performance
title_short Variable Region Sequences Influence 16S rRNA Performance
title_sort variable region sequences influence 16s rrna performance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10269663/
https://www.ncbi.nlm.nih.gov/pubmed/37212673
http://dx.doi.org/10.1128/spectrum.01252-23
work_keys_str_mv AT bosenikhil variableregionsequencesinfluence16srrnaperformance
AT mooreseand variableregionsequencesinfluence16srrnaperformance