Cargando…
Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequ...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8536142/ https://www.ncbi.nlm.nih.gov/pubmed/34680967 http://dx.doi.org/10.3390/genes12101571 |
_version_ | 1784587952885596160 |
---|---|
author | Sievers, Aaron Sauer, Liane Hausmann, Michael Hildenbrand, Georg |
author_facet | Sievers, Aaron Sauer, Liane Hausmann, Michael Hildenbrand, Georg |
author_sort | Sievers, Aaron |
collection | PubMed |
description | Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequence patterns to a wider range of eukaryotic genomes. Our aims were to confirm the conservation of these patterns, to support the hypothesis on their functional constraints and/or the identification of unknown patterns. We pairwise compared genomic DNA sequences of genes, exons, CDS, introns and intergenic regions of 34 Embryophyta (land plants), 30 Protista and 29 Fungi using established k-mer-based (alignment-free) comparison methods. Additionally, the results were compared with values derived for Animalia in former studies. We confirmed strong correlations between the sequence structures of IIRs spanning over the entire domain of Eukaryotes. We found that the high correlations within introns, intergenic regions and between the two are a result of conserved abundancies of STRs with repeat units ≤2 bp (e.g., (AT)n). For some sequence patterns and their inverse complementary sequences, we found a violation of equal distribution on complementary DNA strands in a subset of genomes. Looking at mismatches within the identified STR patterns, we found specific preferences for certain nucleotides stable over all four phylogenetic kingdoms. We conclude that all of these conserved patterns between IIRs indicate a shared function of these sequence structures related to STRs. |
format | Online Article Text |
id | pubmed-8536142 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85361422021-10-23 Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions Sievers, Aaron Sauer, Liane Hausmann, Michael Hildenbrand, Georg Genes (Basel) Article Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequence patterns to a wider range of eukaryotic genomes. Our aims were to confirm the conservation of these patterns, to support the hypothesis on their functional constraints and/or the identification of unknown patterns. We pairwise compared genomic DNA sequences of genes, exons, CDS, introns and intergenic regions of 34 Embryophyta (land plants), 30 Protista and 29 Fungi using established k-mer-based (alignment-free) comparison methods. Additionally, the results were compared with values derived for Animalia in former studies. We confirmed strong correlations between the sequence structures of IIRs spanning over the entire domain of Eukaryotes. We found that the high correlations within introns, intergenic regions and between the two are a result of conserved abundancies of STRs with repeat units ≤2 bp (e.g., (AT)n). For some sequence patterns and their inverse complementary sequences, we found a violation of equal distribution on complementary DNA strands in a subset of genomes. Looking at mismatches within the identified STR patterns, we found specific preferences for certain nucleotides stable over all four phylogenetic kingdoms. We conclude that all of these conserved patterns between IIRs indicate a shared function of these sequence structures related to STRs. MDPI 2021-10-01 /pmc/articles/PMC8536142/ /pubmed/34680967 http://dx.doi.org/10.3390/genes12101571 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Sievers, Aaron Sauer, Liane Hausmann, Michael Hildenbrand, Georg Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions |
title | Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions |
title_full | Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions |
title_fullStr | Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions |
title_full_unstemmed | Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions |
title_short | Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions |
title_sort | eukaryotic genomes show strong evolutionary conservation of k-mer composition and correlation contributions between introns and intergenic regions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8536142/ https://www.ncbi.nlm.nih.gov/pubmed/34680967 http://dx.doi.org/10.3390/genes12101571 |
work_keys_str_mv | AT sieversaaron eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions AT sauerliane eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions AT hausmannmichael eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions AT hildenbrandgeorg eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions |