Cargando…

SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids

GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second...

Descripción completa

Detalles Bibliográficos
Autor principal: Hubert, Bert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8941118/
https://www.ncbi.nlm.nih.gov/pubmed/35318332
http://dx.doi.org/10.1038/s41597-022-01179-8
_version_ 1784673040891641856
author Hubert, Bert
author_facet Hubert, Bert
author_sort Hubert, Bert
collection PubMed
description GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand. Different bacterial phyla show different kinds of skew, and differing relations between TA and GC skew. This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 30,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included. The SkewDB can be used to generate or verify hypotheses. Since the origins of both the second parity rule and GC skew itself are not yet satisfactorily explained, such a database may enhance our understanding of prokaryotic DNA.
format Online
Article
Text
id pubmed-8941118
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-89411182022-04-08 SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids Hubert, Bert Sci Data Data Descriptor GC skew denotes the relative excess of G nucleotides over C nucleotides on the leading versus the lagging replication strand of eubacteria. While the effect is small, typically around 2.5%, it is robust and pervasive. GC skew and the analogous TA skew are a localized deviation from Chargaff’s second parity rule, which states that G and C, and T and A occur with (mostly) equal frequency even within a strand. Different bacterial phyla show different kinds of skew, and differing relations between TA and GC skew. This article introduces an open access database (https://skewdb.org) of GC and 10 other skews for over 30,000 chromosomes and plasmids. Further details like codon bias, strand bias, strand lengths and taxonomic data are also included. The SkewDB can be used to generate or verify hypotheses. Since the origins of both the second parity rule and GC skew itself are not yet satisfactorily explained, such a database may enhance our understanding of prokaryotic DNA. Nature Publishing Group UK 2022-03-22 /pmc/articles/PMC8941118/ /pubmed/35318332 http://dx.doi.org/10.1038/s41597-022-01179-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Hubert, Bert
SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids
title SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids
title_full SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids
title_fullStr SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids
title_full_unstemmed SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids
title_short SkewDB, a comprehensive database of GC and 10 other skews for over 30,000 chromosomes and plasmids
title_sort skewdb, a comprehensive database of gc and 10 other skews for over 30,000 chromosomes and plasmids
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8941118/
https://www.ncbi.nlm.nih.gov/pubmed/35318332
http://dx.doi.org/10.1038/s41597-022-01179-8
work_keys_str_mv AT hubertbert skewdbacomprehensivedatabaseofgcand10otherskewsforover30000chromosomesandplasmids