Cargando…

Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences

Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identifi...

Descripción completa

Detalles Bibliográficos
Autores principales: Karlowski, Wojciech M, Varshney, Deepti, Zielezinski, Andrzej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10003748/
https://www.ncbi.nlm.nih.gov/pubmed/36790099
http://dx.doi.org/10.1093/gbe/evad023
_version_ 1784904677655052288
author Karlowski, Wojciech M
Varshney, Deepti
Zielezinski, Andrzej
author_facet Karlowski, Wojciech M
Varshney, Deepti
Zielezinski, Andrzej
author_sort Karlowski, Wojciech M
collection PubMed
description Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identification parameters and data sources. We show the significant effects of sequence similarity thresholds, composition, and the size of the reference database in the identification process. Subsequently, we applied stringent TRG search parameters and expanded the identification procedure by incorporating an analysis of noncoding and non-syntenic regions of non-Bacillus genomes. A multiplex annotation procedure minimized the number of false-positive TRG predictions and showed nearly one-third of the alleged TRGs could be mapped to genes missed in genome annotations. We traced the putative origin of TRGs by identifying homologous, noncoding genomic regions in non-Bacillus species and detected sequence changes that could transform these regions into protein-coding genes. In addition, our analysis indicated that Bacillus TRGs represent a specific group of genes mostly showing intermediate sequence properties between genes that are conserved across multiple taxa and nonannotated peptides encoded by open reading frames.
format Online
Article
Text
id pubmed-10003748
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100037482023-03-11 Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences Karlowski, Wojciech M Varshney, Deepti Zielezinski, Andrzej Genome Biol Evol Article Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identification parameters and data sources. We show the significant effects of sequence similarity thresholds, composition, and the size of the reference database in the identification process. Subsequently, we applied stringent TRG search parameters and expanded the identification procedure by incorporating an analysis of noncoding and non-syntenic regions of non-Bacillus genomes. A multiplex annotation procedure minimized the number of false-positive TRG predictions and showed nearly one-third of the alleged TRGs could be mapped to genes missed in genome annotations. We traced the putative origin of TRGs by identifying homologous, noncoding genomic regions in non-Bacillus species and detected sequence changes that could transform these regions into protein-coding genes. In addition, our analysis indicated that Bacillus TRGs represent a specific group of genes mostly showing intermediate sequence properties between genes that are conserved across multiple taxa and nonannotated peptides encoded by open reading frames. Oxford University Press 2023-02-15 /pmc/articles/PMC10003748/ /pubmed/36790099 http://dx.doi.org/10.1093/gbe/evad023 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Article
Karlowski, Wojciech M
Varshney, Deepti
Zielezinski, Andrzej
Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
title Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
title_full Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
title_fullStr Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
title_full_unstemmed Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
title_short Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
title_sort taxonomically restricted genes in bacillus may form clusters of homologs and can be traced to a large reservoir of noncoding sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10003748/
https://www.ncbi.nlm.nih.gov/pubmed/36790099
http://dx.doi.org/10.1093/gbe/evad023
work_keys_str_mv AT karlowskiwojciechm taxonomicallyrestrictedgenesinbacillusmayformclustersofhomologsandcanbetracedtoalargereservoirofnoncodingsequences
AT varshneydeepti taxonomicallyrestrictedgenesinbacillusmayformclustersofhomologsandcanbetracedtoalargereservoirofnoncodingsequences
AT zielezinskiandrzej taxonomicallyrestrictedgenesinbacillusmayformclustersofhomologsandcanbetracedtoalargereservoirofnoncodingsequences