Cargando…

BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access

A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis , as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resour...

Descripción completa

Detalles Bibliográficos
Autores principales: Geissler, Adrian Sven, Anthon, Christian, Alkan, Ferhat, González-Tortuero, Enrique, Poulsen, Line Dahl, Kallehauge, Thomas Beuchert, Breüner, Anne, Seemann, Stefan Ernst, Vinther, Jeppe, Gorodkin, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8208703/
https://www.ncbi.nlm.nih.gov/pubmed/33539279
http://dx.doi.org/10.1099/mgen.0.000524
_version_ 1783708974981316608
author Geissler, Adrian Sven
Anthon, Christian
Alkan, Ferhat
González-Tortuero, Enrique
Poulsen, Line Dahl
Kallehauge, Thomas Beuchert
Breüner, Anne
Seemann, Stefan Ernst
Vinther, Jeppe
Gorodkin, Jan
author_facet Geissler, Adrian Sven
Anthon, Christian
Alkan, Ferhat
González-Tortuero, Enrique
Poulsen, Line Dahl
Kallehauge, Thomas Beuchert
Breüner, Anne
Seemann, Stefan Ernst
Vinther, Jeppe
Gorodkin, Jan
author_sort Geissler, Adrian Sven
collection PubMed
description A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis , as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low- and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70 % of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5′ and 3′ UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92 % of all genes in the genome compared to the at most 82 % by previous resources. We predicted an off-target-aware genome-wide library of CRISPR–Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://rth.dk/resources/bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale -omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes.
format Online
Article
Text
id pubmed-8208703
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-82087032021-06-17 BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access Geissler, Adrian Sven Anthon, Christian Alkan, Ferhat González-Tortuero, Enrique Poulsen, Line Dahl Kallehauge, Thomas Beuchert Breüner, Anne Seemann, Stefan Ernst Vinther, Jeppe Gorodkin, Jan Microb Genom Research Article A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis , as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low- and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70 % of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5′ and 3′ UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92 % of all genes in the genome compared to the at most 82 % by previous resources. We predicted an off-target-aware genome-wide library of CRISPR–Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://rth.dk/resources/bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale -omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes. Microbiology Society 2021-02-04 /pmc/articles/PMC8208703/ /pubmed/33539279 http://dx.doi.org/10.1099/mgen.0.000524 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License.
spellingShingle Research Article
Geissler, Adrian Sven
Anthon, Christian
Alkan, Ferhat
González-Tortuero, Enrique
Poulsen, Line Dahl
Kallehauge, Thomas Beuchert
Breüner, Anne
Seemann, Stefan Ernst
Vinther, Jeppe
Gorodkin, Jan
BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
title BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
title_full BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
title_fullStr BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
title_full_unstemmed BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
title_short BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
title_sort bsgatlas: a unified bacillus subtilis genome and transcriptome annotation atlas with enhanced information access
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8208703/
https://www.ncbi.nlm.nih.gov/pubmed/33539279
http://dx.doi.org/10.1099/mgen.0.000524
work_keys_str_mv AT geissleradriansven bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT anthonchristian bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT alkanferhat bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT gonzaleztortueroenrique bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT poulsenlinedahl bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT kallehaugethomasbeuchert bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT breuneranne bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT seemannstefanernst bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT vintherjeppe bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess
AT gorodkinjan bsgatlasaunifiedbacillussubtilisgenomeandtranscriptomeannotationatlaswithenhancedinformationaccess