Cargando…

Prevalence of quadruplexes in the human genome

Guanine-rich DNA sequences of a particular form have the ability to fold into four-stranded structures called G-quadruplexes. In this paper, we present a working rule to predict which primary sequences can form this structure, and describe a search algorithm to identify such sequences in genomic DNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Huppert, Julian L., Balasubramanian, Shankar
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1140081/
https://www.ncbi.nlm.nih.gov/pubmed/15914667
http://dx.doi.org/10.1093/nar/gki609
_version_ 1782124243088048128
author Huppert, Julian L.
Balasubramanian, Shankar
author_facet Huppert, Julian L.
Balasubramanian, Shankar
author_sort Huppert, Julian L.
collection PubMed
description Guanine-rich DNA sequences of a particular form have the ability to fold into four-stranded structures called G-quadruplexes. In this paper, we present a working rule to predict which primary sequences can form this structure, and describe a search algorithm to identify such sequences in genomic DNA. We count the number of quadruplexes found in the human genome and compare that with the figure predicted by modelling DNA as a Bernoulli stream or as a Markov chain, using windows of various sizes. We demonstrate that the distribution of loop lengths is significantly different from what would be expected in a random case, providing an indication of the number of potentially relevant quadruplex-forming sequences. In particular, we show that there is a significant repression of quadruplexes in the coding strand of exonic regions, which suggests that quadruplex-forming patterns are disfavoured in sequences that will form RNA.
format Text
id pubmed-1140081
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-11400812005-05-25 Prevalence of quadruplexes in the human genome Huppert, Julian L. Balasubramanian, Shankar Nucleic Acids Res Article Guanine-rich DNA sequences of a particular form have the ability to fold into four-stranded structures called G-quadruplexes. In this paper, we present a working rule to predict which primary sequences can form this structure, and describe a search algorithm to identify such sequences in genomic DNA. We count the number of quadruplexes found in the human genome and compare that with the figure predicted by modelling DNA as a Bernoulli stream or as a Markov chain, using windows of various sizes. We demonstrate that the distribution of loop lengths is significantly different from what would be expected in a random case, providing an indication of the number of potentially relevant quadruplex-forming sequences. In particular, we show that there is a significant repression of quadruplexes in the coding strand of exonic regions, which suggests that quadruplex-forming patterns are disfavoured in sequences that will form RNA. Oxford University Press 2005 2005-05-24 /pmc/articles/PMC1140081/ /pubmed/15914667 http://dx.doi.org/10.1093/nar/gki609 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle Article
Huppert, Julian L.
Balasubramanian, Shankar
Prevalence of quadruplexes in the human genome
title Prevalence of quadruplexes in the human genome
title_full Prevalence of quadruplexes in the human genome
title_fullStr Prevalence of quadruplexes in the human genome
title_full_unstemmed Prevalence of quadruplexes in the human genome
title_short Prevalence of quadruplexes in the human genome
title_sort prevalence of quadruplexes in the human genome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1140081/
https://www.ncbi.nlm.nih.gov/pubmed/15914667
http://dx.doi.org/10.1093/nar/gki609
work_keys_str_mv AT huppertjulianl prevalenceofquadruplexesinthehumangenome
AT balasubramanianshankar prevalenceofquadruplexesinthehumangenome