Cargando…
Structure of the space of taboo-free sequences
Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphab...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7560954/ https://www.ncbi.nlm.nih.gov/pubmed/32940748 http://dx.doi.org/10.1007/s00285-020-01535-5 |
_version_ | 1783595185737826304 |
---|---|
author | Manuel, Cassius von Haeseler, Arndt |
author_facet | Manuel, Cassius von Haeseler, Arndt |
author_sort | Manuel, Cassius |
collection | PubMed |
description | Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphabet with taboos, that is, with prohibited substrings. The taboo-set is referred to as [Formula: see text] and any allowed string as a taboo-free string. We consider the so-called Hamming graph [Formula: see text] , whose vertices are taboo-free strings of length n and whose edges connect two taboo-free strings if their Hamming distance equals one. Any (random) walk on this graph describes the evolution of a DNA sequence that avoids taboos. We describe the construction of the vertex set of [Formula: see text] . Then we state conditions under which [Formula: see text] and its suffix subgraphs are connected. Moreover, we provide an algorithm that determines if all these graphs are connected for an arbitrary [Formula: see text] . As an application of the algorithm, we show that about [Formula: see text] of bacteria listed in REBASE have a taboo-set that induces connected taboo-free Hamming graphs, because they have less than four type II restriction enzymes. On the other hand, four properly chosen taboos are enough to disconnect one suffix subgraph, and consequently connectivity of taboo-free Hamming graphs could change depending on the composition of restriction sites. |
format | Online Article Text |
id | pubmed-7560954 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-75609542020-10-19 Structure of the space of taboo-free sequences Manuel, Cassius von Haeseler, Arndt J Math Biol Article Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphabet with taboos, that is, with prohibited substrings. The taboo-set is referred to as [Formula: see text] and any allowed string as a taboo-free string. We consider the so-called Hamming graph [Formula: see text] , whose vertices are taboo-free strings of length n and whose edges connect two taboo-free strings if their Hamming distance equals one. Any (random) walk on this graph describes the evolution of a DNA sequence that avoids taboos. We describe the construction of the vertex set of [Formula: see text] . Then we state conditions under which [Formula: see text] and its suffix subgraphs are connected. Moreover, we provide an algorithm that determines if all these graphs are connected for an arbitrary [Formula: see text] . As an application of the algorithm, we show that about [Formula: see text] of bacteria listed in REBASE have a taboo-set that induces connected taboo-free Hamming graphs, because they have less than four type II restriction enzymes. On the other hand, four properly chosen taboos are enough to disconnect one suffix subgraph, and consequently connectivity of taboo-free Hamming graphs could change depending on the composition of restriction sites. Springer Berlin Heidelberg 2020-09-17 2020 /pmc/articles/PMC7560954/ /pubmed/32940748 http://dx.doi.org/10.1007/s00285-020-01535-5 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Manuel, Cassius von Haeseler, Arndt Structure of the space of taboo-free sequences |
title | Structure of the space of taboo-free sequences |
title_full | Structure of the space of taboo-free sequences |
title_fullStr | Structure of the space of taboo-free sequences |
title_full_unstemmed | Structure of the space of taboo-free sequences |
title_short | Structure of the space of taboo-free sequences |
title_sort | structure of the space of taboo-free sequences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7560954/ https://www.ncbi.nlm.nih.gov/pubmed/32940748 http://dx.doi.org/10.1007/s00285-020-01535-5 |
work_keys_str_mv | AT manuelcassius structureofthespaceoftaboofreesequences AT vonhaeselerarndt structureofthespaceoftaboofreesequences |