Cargando…

QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs

BACKGROUND: Nucleic acids containing guanine tracts can form quadruplex structures via non-Watson-Crick base pairing. Formation of G-quadruplexes is associated with the regulation of important biological functions such as transcription, genetic instability, DNA repair, DNA replication, epigenetic me...

Descripción completa

Detalles Bibliográficos
Autores principales: Frees, Scott, Menendez, Camille, Crum, Matt, Bagga, Paramjeet S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4017754/
https://www.ncbi.nlm.nih.gov/pubmed/24885782
http://dx.doi.org/10.1186/1479-7364-8-8
_version_ 1782480002848129024
author Frees, Scott
Menendez, Camille
Crum, Matt
Bagga, Paramjeet S
author_facet Frees, Scott
Menendez, Camille
Crum, Matt
Bagga, Paramjeet S
author_sort Frees, Scott
collection PubMed
description BACKGROUND: Nucleic acids containing guanine tracts can form quadruplex structures via non-Watson-Crick base pairing. Formation of G-quadruplexes is associated with the regulation of important biological functions such as transcription, genetic instability, DNA repair, DNA replication, epigenetic mechanisms, regulation of translation, and alternative splicing. G-quadruplexes play important roles in human diseases and are being considered as targets for a variety of therapies. Identification of functional G-quadruplexes and the study of their overall distribution in genomes and transcriptomes is an important pursuit. Traditional computational methods map sequence motifs capable of forming G-quadruplexes but have difficulty in distinguishing motifs that occur by chance from ones which fold into G-quadruplexes. RESULTS: We present Quadruplex forming ‘G’-rich sequences (QGRS)-Conserve, a computational method for calculating motif conservation across exomes and supports filtering to provide researchers with more precise methods of studying G-quadruplex distribution patterns. Our method quantitatively evaluates conservation between quadruplexes found in homologous nucleotide sequences based on several motif structural characteristics. QGRS-Conserve also efficiently manages overlapping G-quadruplex sequences such that the resulting datasets can be analyzed effectively. CONCLUSIONS: We have applied QGRS-Conserve to identify a large number of G-quadruplex motifs in the human exome conserved across several mammalian and non-mammalian species. We have successfully identified multiple homologs of many previously published G-quadruplexes that play post-transcriptional regulatory roles in human genes. Preliminary large-scale analysis identified many homologous G-quadruplexes in the 5′- and 3′-untranslated regions of mammalian species. An expectedly smaller set of G-quadruplex motifs was found to be conserved across larger phylogenetic distances. QGRS-Conserve provides means to build datasets that can be filtered and categorized in a variety of biological dimensions for more targeted studies in order to better understand the roles that G-quadruplexes play.
format Online
Article
Text
id pubmed-4017754
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40177542014-05-27 QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs Frees, Scott Menendez, Camille Crum, Matt Bagga, Paramjeet S Hum Genomics Primary Research BACKGROUND: Nucleic acids containing guanine tracts can form quadruplex structures via non-Watson-Crick base pairing. Formation of G-quadruplexes is associated with the regulation of important biological functions such as transcription, genetic instability, DNA repair, DNA replication, epigenetic mechanisms, regulation of translation, and alternative splicing. G-quadruplexes play important roles in human diseases and are being considered as targets for a variety of therapies. Identification of functional G-quadruplexes and the study of their overall distribution in genomes and transcriptomes is an important pursuit. Traditional computational methods map sequence motifs capable of forming G-quadruplexes but have difficulty in distinguishing motifs that occur by chance from ones which fold into G-quadruplexes. RESULTS: We present Quadruplex forming ‘G’-rich sequences (QGRS)-Conserve, a computational method for calculating motif conservation across exomes and supports filtering to provide researchers with more precise methods of studying G-quadruplex distribution patterns. Our method quantitatively evaluates conservation between quadruplexes found in homologous nucleotide sequences based on several motif structural characteristics. QGRS-Conserve also efficiently manages overlapping G-quadruplex sequences such that the resulting datasets can be analyzed effectively. CONCLUSIONS: We have applied QGRS-Conserve to identify a large number of G-quadruplex motifs in the human exome conserved across several mammalian and non-mammalian species. We have successfully identified multiple homologs of many previously published G-quadruplexes that play post-transcriptional regulatory roles in human genes. Preliminary large-scale analysis identified many homologous G-quadruplexes in the 5′- and 3′-untranslated regions of mammalian species. An expectedly smaller set of G-quadruplex motifs was found to be conserved across larger phylogenetic distances. QGRS-Conserve provides means to build datasets that can be filtered and categorized in a variety of biological dimensions for more targeted studies in order to better understand the roles that G-quadruplexes play. BioMed Central 2014-05-01 /pmc/articles/PMC4017754/ /pubmed/24885782 http://dx.doi.org/10.1186/1479-7364-8-8 Text en Copyright © 2014 Frees et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Primary Research
Frees, Scott
Menendez, Camille
Crum, Matt
Bagga, Paramjeet S
QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs
title QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs
title_full QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs
title_fullStr QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs
title_full_unstemmed QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs
title_short QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs
title_sort qgrs-conserve: a computational method for discovering evolutionarily conserved g-quadruplex motifs
topic Primary Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4017754/
https://www.ncbi.nlm.nih.gov/pubmed/24885782
http://dx.doi.org/10.1186/1479-7364-8-8
work_keys_str_mv AT freesscott qgrsconserveacomputationalmethodfordiscoveringevolutionarilyconservedgquadruplexmotifs
AT menendezcamille qgrsconserveacomputationalmethodfordiscoveringevolutionarilyconservedgquadruplexmotifs
AT crummatt qgrsconserveacomputationalmethodfordiscoveringevolutionarilyconservedgquadruplexmotifs
AT baggaparamjeets qgrsconserveacomputationalmethodfordiscoveringevolutionarilyconservedgquadruplexmotifs