Cargando…

Mapping the sequences of potential guanine quadruplex motifs

The knowledge that potential guanine quadruplex sequences (PQs) are non-randomly distributed in relation to genomic features is now well established. However, this is for a general potential quadruplex motif which is characterized by short runs of guanine separated by loop regions, regardless of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Todd, Alan K., Neidle, Stephen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3130275/
https://www.ncbi.nlm.nih.gov/pubmed/21357607
http://dx.doi.org/10.1093/nar/gkr104
_version_ 1782207599643459584
author Todd, Alan K.
Neidle, Stephen
author_facet Todd, Alan K.
Neidle, Stephen
author_sort Todd, Alan K.
collection PubMed
description The knowledge that potential guanine quadruplex sequences (PQs) are non-randomly distributed in relation to genomic features is now well established. However, this is for a general potential quadruplex motif which is characterized by short runs of guanine separated by loop regions, regardless of the nature of the loop sequence. There have been no studies to date which map the distribution of PQs in terms of primary sequence or which categorize PQs. To this end, we have generated clusters of PQ sequence groups of various sizes and various degrees of similarity for the non-template strand of introns in the human genome. We started with 86 697 sequences, and successively merged them into groups based on sequence similarity, carrying out 66 clustering cycles before convergence. We have demonstrated here that by using complete linkage hierarchical agglomerative clustering such PQ sequence categorization can be achieved. Our results give an insight into sequence diversity and categories of PQ sequences which occur in human intronic regions. We also highlight a number of clusters for which interesting relationships among their members were immediately evident and other clusters whose members seem unrelated, illustrating, we believe, a distinct role for different sequence types.
format Online
Article
Text
id pubmed-3130275
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31302752011-07-06 Mapping the sequences of potential guanine quadruplex motifs Todd, Alan K. Neidle, Stephen Nucleic Acids Res Computational Biology The knowledge that potential guanine quadruplex sequences (PQs) are non-randomly distributed in relation to genomic features is now well established. However, this is for a general potential quadruplex motif which is characterized by short runs of guanine separated by loop regions, regardless of the nature of the loop sequence. There have been no studies to date which map the distribution of PQs in terms of primary sequence or which categorize PQs. To this end, we have generated clusters of PQ sequence groups of various sizes and various degrees of similarity for the non-template strand of introns in the human genome. We started with 86 697 sequences, and successively merged them into groups based on sequence similarity, carrying out 66 clustering cycles before convergence. We have demonstrated here that by using complete linkage hierarchical agglomerative clustering such PQ sequence categorization can be achieved. Our results give an insight into sequence diversity and categories of PQ sequences which occur in human intronic regions. We also highlight a number of clusters for which interesting relationships among their members were immediately evident and other clusters whose members seem unrelated, illustrating, we believe, a distinct role for different sequence types. Oxford University Press 2011-07 2011-02-26 /pmc/articles/PMC3130275/ /pubmed/21357607 http://dx.doi.org/10.1093/nar/gkr104 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Todd, Alan K.
Neidle, Stephen
Mapping the sequences of potential guanine quadruplex motifs
title Mapping the sequences of potential guanine quadruplex motifs
title_full Mapping the sequences of potential guanine quadruplex motifs
title_fullStr Mapping the sequences of potential guanine quadruplex motifs
title_full_unstemmed Mapping the sequences of potential guanine quadruplex motifs
title_short Mapping the sequences of potential guanine quadruplex motifs
title_sort mapping the sequences of potential guanine quadruplex motifs
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3130275/
https://www.ncbi.nlm.nih.gov/pubmed/21357607
http://dx.doi.org/10.1093/nar/gkr104
work_keys_str_mv AT toddalank mappingthesequencesofpotentialguaninequadruplexmotifs
AT neidlestephen mappingthesequencesofpotentialguaninequadruplexmotifs