Cargando…

Structure of the space of taboo-free sequences

Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphab...

Descripción completa

Detalles Bibliográficos
Autores principales: Manuel, Cassius, von Haeseler, Arndt
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7560954/
https://www.ncbi.nlm.nih.gov/pubmed/32940748
http://dx.doi.org/10.1007/s00285-020-01535-5
_version_ 1783595185737826304
author Manuel, Cassius
von Haeseler, Arndt
author_facet Manuel, Cassius
von Haeseler, Arndt
author_sort Manuel, Cassius
collection PubMed
description Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphabet with taboos, that is, with prohibited substrings. The taboo-set is referred to as [Formula: see text] and any allowed string as a taboo-free string. We consider the so-called Hamming graph [Formula: see text] , whose vertices are taboo-free strings of length n and whose edges connect two taboo-free strings if their Hamming distance equals one. Any (random) walk on this graph describes the evolution of a DNA sequence that avoids taboos. We describe the construction of the vertex set of [Formula: see text] . Then we state conditions under which [Formula: see text] and its suffix subgraphs are connected. Moreover, we provide an algorithm that determines if all these graphs are connected for an arbitrary [Formula: see text] . As an application of the algorithm, we show that about [Formula: see text] of bacteria listed in REBASE have a taboo-set that induces connected taboo-free Hamming graphs, because they have less than four type II restriction enzymes. On the other hand, four properly chosen taboos are enough to disconnect one suffix subgraph, and consequently connectivity of taboo-free Hamming graphs could change depending on the composition of restriction sites.
format Online
Article
Text
id pubmed-7560954
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-75609542020-10-19 Structure of the space of taboo-free sequences Manuel, Cassius von Haeseler, Arndt J Math Biol Article Models of sequence evolution typically assume that all sequences are possible. However, restriction enzymes that cut DNA at specific recognition sites provide an example where carrying a recognition site can be lethal. Motivated by this observation, we studied the set of strings over a finite alphabet with taboos, that is, with prohibited substrings. The taboo-set is referred to as [Formula: see text] and any allowed string as a taboo-free string. We consider the so-called Hamming graph [Formula: see text] , whose vertices are taboo-free strings of length n and whose edges connect two taboo-free strings if their Hamming distance equals one. Any (random) walk on this graph describes the evolution of a DNA sequence that avoids taboos. We describe the construction of the vertex set of [Formula: see text] . Then we state conditions under which [Formula: see text] and its suffix subgraphs are connected. Moreover, we provide an algorithm that determines if all these graphs are connected for an arbitrary [Formula: see text] . As an application of the algorithm, we show that about [Formula: see text] of bacteria listed in REBASE have a taboo-set that induces connected taboo-free Hamming graphs, because they have less than four type II restriction enzymes. On the other hand, four properly chosen taboos are enough to disconnect one suffix subgraph, and consequently connectivity of taboo-free Hamming graphs could change depending on the composition of restriction sites. Springer Berlin Heidelberg 2020-09-17 2020 /pmc/articles/PMC7560954/ /pubmed/32940748 http://dx.doi.org/10.1007/s00285-020-01535-5 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Manuel, Cassius
von Haeseler, Arndt
Structure of the space of taboo-free sequences
title Structure of the space of taboo-free sequences
title_full Structure of the space of taboo-free sequences
title_fullStr Structure of the space of taboo-free sequences
title_full_unstemmed Structure of the space of taboo-free sequences
title_short Structure of the space of taboo-free sequences
title_sort structure of the space of taboo-free sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7560954/
https://www.ncbi.nlm.nih.gov/pubmed/32940748
http://dx.doi.org/10.1007/s00285-020-01535-5
work_keys_str_mv AT manuelcassius structureofthespaceoftaboofreesequences
AT vonhaeselerarndt structureofthespaceoftaboofreesequences