Cargando…

Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization

DNA sequence composition determines the topology and stability of G-quadruplexes (G4s). Bulged G-quadruplex structures (G4-Bs) are a subset of G4s characterized by 3D conformations with bulges. Current search algorithms fail to capture stable G4-B, making their genome-wide study infeasible. Here, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Papp, Csaba, Mukundan, Vineeth T, Jenjaroenpun, Piroon, Winnerdy, Fernaldo Richtia, Ow, Ghim Siong, Phan, Anh Tuân, Kuznetsov, Vladimir A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10201450/
https://www.ncbi.nlm.nih.gov/pubmed/37094040
http://dx.doi.org/10.1093/nar/gkad252
_version_ 1785045265893294080
author Papp, Csaba
Mukundan, Vineeth T
Jenjaroenpun, Piroon
Winnerdy, Fernaldo Richtia
Ow, Ghim Siong
Phan, Anh Tuân
Kuznetsov, Vladimir A
author_facet Papp, Csaba
Mukundan, Vineeth T
Jenjaroenpun, Piroon
Winnerdy, Fernaldo Richtia
Ow, Ghim Siong
Phan, Anh Tuân
Kuznetsov, Vladimir A
author_sort Papp, Csaba
collection PubMed
description DNA sequence composition determines the topology and stability of G-quadruplexes (G4s). Bulged G-quadruplex structures (G4-Bs) are a subset of G4s characterized by 3D conformations with bulges. Current search algorithms fail to capture stable G4-B, making their genome-wide study infeasible. Here, we introduced a large family of computationally defined and experimentally verified potential G4-B forming sequences (pG4-BS). We found 478 263 pG4-BS regions that do not overlap ‘canonical’ G4-forming sequences in the human genome and are preferentially localized in transcription regulatory regions including R-loops and open chromatin. Over 90% of protein-coding genes contain pG4-BS in their promoter or gene body. We observed generally higher pG4-BS content in R-loops and their flanks, longer genes that are associated with brain tissue, immune and developmental processes. Also, the presence of pG4-BS on both template and non-template strands in promoters is associated with oncogenesis, cardiovascular disease and stemness. Our G4-BS models predicted G4-forming ability in vitro with 91.5% accuracy. Analysis of G4-seq and CUT&Tag data strongly supports the existence of G4-BS conformations genome-wide. We reconstructed a novel G4-B 3D structure located in the E2F8 promoter. This study defines a large family of G4-like sequences, offering new insights into the essential biological functions and potential future therapeutic uses of G4-B.
format Online
Article
Text
id pubmed-10201450
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102014502023-05-23 Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization Papp, Csaba Mukundan, Vineeth T Jenjaroenpun, Piroon Winnerdy, Fernaldo Richtia Ow, Ghim Siong Phan, Anh Tuân Kuznetsov, Vladimir A Nucleic Acids Res Computational Biology DNA sequence composition determines the topology and stability of G-quadruplexes (G4s). Bulged G-quadruplex structures (G4-Bs) are a subset of G4s characterized by 3D conformations with bulges. Current search algorithms fail to capture stable G4-B, making their genome-wide study infeasible. Here, we introduced a large family of computationally defined and experimentally verified potential G4-B forming sequences (pG4-BS). We found 478 263 pG4-BS regions that do not overlap ‘canonical’ G4-forming sequences in the human genome and are preferentially localized in transcription regulatory regions including R-loops and open chromatin. Over 90% of protein-coding genes contain pG4-BS in their promoter or gene body. We observed generally higher pG4-BS content in R-loops and their flanks, longer genes that are associated with brain tissue, immune and developmental processes. Also, the presence of pG4-BS on both template and non-template strands in promoters is associated with oncogenesis, cardiovascular disease and stemness. Our G4-BS models predicted G4-forming ability in vitro with 91.5% accuracy. Analysis of G4-seq and CUT&Tag data strongly supports the existence of G4-BS conformations genome-wide. We reconstructed a novel G4-B 3D structure located in the E2F8 promoter. This study defines a large family of G4-like sequences, offering new insights into the essential biological functions and potential future therapeutic uses of G4-B. Oxford University Press 2023-04-24 /pmc/articles/PMC10201450/ /pubmed/37094040 http://dx.doi.org/10.1093/nar/gkad252 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Papp, Csaba
Mukundan, Vineeth T
Jenjaroenpun, Piroon
Winnerdy, Fernaldo Richtia
Ow, Ghim Siong
Phan, Anh Tuân
Kuznetsov, Vladimir A
Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
title Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
title_full Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
title_fullStr Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
title_full_unstemmed Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
title_short Stable bulged G-quadruplexes in the human genome: identification, experimental validation and functionalization
title_sort stable bulged g-quadruplexes in the human genome: identification, experimental validation and functionalization
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10201450/
https://www.ncbi.nlm.nih.gov/pubmed/37094040
http://dx.doi.org/10.1093/nar/gkad252
work_keys_str_mv AT pappcsaba stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization
AT mukundanvineetht stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization
AT jenjaroenpunpiroon stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization
AT winnerdyfernaldorichtia stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization
AT owghimsiong stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization
AT phananhtuan stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization
AT kuznetsovvladimira stablebulgedgquadruplexesinthehumangenomeidentificationexperimentalvalidationandfunctionalization