Cargando…
Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences
Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identifi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10003748/ https://www.ncbi.nlm.nih.gov/pubmed/36790099 http://dx.doi.org/10.1093/gbe/evad023 |
_version_ | 1784904677655052288 |
---|---|
author | Karlowski, Wojciech M Varshney, Deepti Zielezinski, Andrzej |
author_facet | Karlowski, Wojciech M Varshney, Deepti Zielezinski, Andrzej |
author_sort | Karlowski, Wojciech M |
collection | PubMed |
description | Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identification parameters and data sources. We show the significant effects of sequence similarity thresholds, composition, and the size of the reference database in the identification process. Subsequently, we applied stringent TRG search parameters and expanded the identification procedure by incorporating an analysis of noncoding and non-syntenic regions of non-Bacillus genomes. A multiplex annotation procedure minimized the number of false-positive TRG predictions and showed nearly one-third of the alleged TRGs could be mapped to genes missed in genome annotations. We traced the putative origin of TRGs by identifying homologous, noncoding genomic regions in non-Bacillus species and detected sequence changes that could transform these regions into protein-coding genes. In addition, our analysis indicated that Bacillus TRGs represent a specific group of genes mostly showing intermediate sequence properties between genes that are conserved across multiple taxa and nonannotated peptides encoded by open reading frames. |
format | Online Article Text |
id | pubmed-10003748 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100037482023-03-11 Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences Karlowski, Wojciech M Varshney, Deepti Zielezinski, Andrzej Genome Biol Evol Article Taxonomically restricted genes (TRGs) are unique for a defined group of organisms and may act as potential genetic determinants of lineage-specific, biological properties. Here, we explore the TRGs of highly diverse and economically important Bacillus bacteria by examining commonly used TRG identification parameters and data sources. We show the significant effects of sequence similarity thresholds, composition, and the size of the reference database in the identification process. Subsequently, we applied stringent TRG search parameters and expanded the identification procedure by incorporating an analysis of noncoding and non-syntenic regions of non-Bacillus genomes. A multiplex annotation procedure minimized the number of false-positive TRG predictions and showed nearly one-third of the alleged TRGs could be mapped to genes missed in genome annotations. We traced the putative origin of TRGs by identifying homologous, noncoding genomic regions in non-Bacillus species and detected sequence changes that could transform these regions into protein-coding genes. In addition, our analysis indicated that Bacillus TRGs represent a specific group of genes mostly showing intermediate sequence properties between genes that are conserved across multiple taxa and nonannotated peptides encoded by open reading frames. Oxford University Press 2023-02-15 /pmc/articles/PMC10003748/ /pubmed/36790099 http://dx.doi.org/10.1093/gbe/evad023 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Article Karlowski, Wojciech M Varshney, Deepti Zielezinski, Andrzej Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences |
title | Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences |
title_full | Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences |
title_fullStr | Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences |
title_full_unstemmed | Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences |
title_short | Taxonomically Restricted Genes in Bacillus may Form Clusters of Homologs and Can be Traced to a Large Reservoir of Noncoding Sequences |
title_sort | taxonomically restricted genes in bacillus may form clusters of homologs and can be traced to a large reservoir of noncoding sequences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10003748/ https://www.ncbi.nlm.nih.gov/pubmed/36790099 http://dx.doi.org/10.1093/gbe/evad023 |
work_keys_str_mv | AT karlowskiwojciechm taxonomicallyrestrictedgenesinbacillusmayformclustersofhomologsandcanbetracedtoalargereservoirofnoncodingsequences AT varshneydeepti taxonomicallyrestrictedgenesinbacillusmayformclustersofhomologsandcanbetracedtoalargereservoirofnoncodingsequences AT zielezinskiandrzej taxonomicallyrestrictedgenesinbacillusmayformclustersofhomologsandcanbetracedtoalargereservoirofnoncodingsequences |