Cargando…

Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions

Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequ...

Descripción completa

Detalles Bibliográficos
Autores principales: Sievers, Aaron, Sauer, Liane, Hausmann, Michael, Hildenbrand, Georg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8536142/
https://www.ncbi.nlm.nih.gov/pubmed/34680967
http://dx.doi.org/10.3390/genes12101571
_version_ 1784587952885596160
author Sievers, Aaron
Sauer, Liane
Hausmann, Michael
Hildenbrand, Georg
author_facet Sievers, Aaron
Sauer, Liane
Hausmann, Michael
Hildenbrand, Georg
author_sort Sievers, Aaron
collection PubMed
description Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequence patterns to a wider range of eukaryotic genomes. Our aims were to confirm the conservation of these patterns, to support the hypothesis on their functional constraints and/or the identification of unknown patterns. We pairwise compared genomic DNA sequences of genes, exons, CDS, introns and intergenic regions of 34 Embryophyta (land plants), 30 Protista and 29 Fungi using established k-mer-based (alignment-free) comparison methods. Additionally, the results were compared with values derived for Animalia in former studies. We confirmed strong correlations between the sequence structures of IIRs spanning over the entire domain of Eukaryotes. We found that the high correlations within introns, intergenic regions and between the two are a result of conserved abundancies of STRs with repeat units ≤2 bp (e.g., (AT)n). For some sequence patterns and their inverse complementary sequences, we found a violation of equal distribution on complementary DNA strands in a subset of genomes. Looking at mismatches within the identified STR patterns, we found specific preferences for certain nucleotides stable over all four phylogenetic kingdoms. We conclude that all of these conserved patterns between IIRs indicate a shared function of these sequence structures related to STRs.
format Online
Article
Text
id pubmed-8536142
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85361422021-10-23 Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions Sievers, Aaron Sauer, Liane Hausmann, Michael Hildenbrand, Georg Genes (Basel) Article Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequence patterns to a wider range of eukaryotic genomes. Our aims were to confirm the conservation of these patterns, to support the hypothesis on their functional constraints and/or the identification of unknown patterns. We pairwise compared genomic DNA sequences of genes, exons, CDS, introns and intergenic regions of 34 Embryophyta (land plants), 30 Protista and 29 Fungi using established k-mer-based (alignment-free) comparison methods. Additionally, the results were compared with values derived for Animalia in former studies. We confirmed strong correlations between the sequence structures of IIRs spanning over the entire domain of Eukaryotes. We found that the high correlations within introns, intergenic regions and between the two are a result of conserved abundancies of STRs with repeat units ≤2 bp (e.g., (AT)n). For some sequence patterns and their inverse complementary sequences, we found a violation of equal distribution on complementary DNA strands in a subset of genomes. Looking at mismatches within the identified STR patterns, we found specific preferences for certain nucleotides stable over all four phylogenetic kingdoms. We conclude that all of these conserved patterns between IIRs indicate a shared function of these sequence structures related to STRs. MDPI 2021-10-01 /pmc/articles/PMC8536142/ /pubmed/34680967 http://dx.doi.org/10.3390/genes12101571 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sievers, Aaron
Sauer, Liane
Hausmann, Michael
Hildenbrand, Georg
Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
title Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
title_full Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
title_fullStr Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
title_full_unstemmed Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
title_short Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions
title_sort eukaryotic genomes show strong evolutionary conservation of k-mer composition and correlation contributions between introns and intergenic regions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8536142/
https://www.ncbi.nlm.nih.gov/pubmed/34680967
http://dx.doi.org/10.3390/genes12101571
work_keys_str_mv AT sieversaaron eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions
AT sauerliane eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions
AT hausmannmichael eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions
AT hildenbrandgeorg eukaryoticgenomesshowstrongevolutionaryconservationofkmercompositionandcorrelationcontributionsbetweenintronsandintergenicregions