Cargando…

Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers

R-loops are three-stranded RNA:DNA hybrid structures essential for many normal and pathobiological processes. Previously, we generated a quantitative R-loop forming sequence (RLFS) model, quantitative model of R-loop-forming sequences (QmRLFS) and predicted ∼660 000 RLFSs; most of them located in ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuznetsov, Vladimir A, Bondarenko, Vladyslav, Wongsurawat, Thidathip, Yenamandra, Surya P, Jenjaroenpun, Piroon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6125637/
https://www.ncbi.nlm.nih.gov/pubmed/29945198
http://dx.doi.org/10.1093/nar/gky554
_version_ 1783353196386713600
author Kuznetsov, Vladimir A
Bondarenko, Vladyslav
Wongsurawat, Thidathip
Yenamandra, Surya P
Jenjaroenpun, Piroon
author_facet Kuznetsov, Vladimir A
Bondarenko, Vladyslav
Wongsurawat, Thidathip
Yenamandra, Surya P
Jenjaroenpun, Piroon
author_sort Kuznetsov, Vladimir A
collection PubMed
description R-loops are three-stranded RNA:DNA hybrid structures essential for many normal and pathobiological processes. Previously, we generated a quantitative R-loop forming sequence (RLFS) model, quantitative model of R-loop-forming sequences (QmRLFS) and predicted ∼660 000 RLFSs; most of them located in genes and gene-flanking regions, G-rich regions and disease-associated genomic loci in the human genome. Here, we conducted a comprehensive comparative analysis of these RLFSs using experimental data and demonstrated the high performance of QmRLFS predictions on the nucleotide and genome scales. The preferential co-localization of RLFS with promoters, U1 splice sites, gene ends, enhancers and non-B DNA structures, such as G-quadruplexes, provides evidence for the mechanical linkage between DNA tertiary structures, transcription initiation and R-loops in critical regulatory genome regions. We introduced and characterized an abundant class of reverse-forward RLFS clusters highly enriched in non-B DNA structures, which localized to promoters, gene ends and enhancers. The RLFS co-localization with promoters and transcriptionally active enhancers suggested new models for in cis and in trans regulation by RNA:DNA hybrids of transcription initiation and formation of 3D-chromatin loops. Overall, this study provides a rationale for the discovery and characterization of the non-B DNA regulatory structures involved in the formation of the RNA:DNA interactome as the basis for an emerging quantitative R-loop biology and pathobiology.
format Online
Article
Text
id pubmed-6125637
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61256372018-09-11 Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers Kuznetsov, Vladimir A Bondarenko, Vladyslav Wongsurawat, Thidathip Yenamandra, Surya P Jenjaroenpun, Piroon Nucleic Acids Res Computational Biology R-loops are three-stranded RNA:DNA hybrid structures essential for many normal and pathobiological processes. Previously, we generated a quantitative R-loop forming sequence (RLFS) model, quantitative model of R-loop-forming sequences (QmRLFS) and predicted ∼660 000 RLFSs; most of them located in genes and gene-flanking regions, G-rich regions and disease-associated genomic loci in the human genome. Here, we conducted a comprehensive comparative analysis of these RLFSs using experimental data and demonstrated the high performance of QmRLFS predictions on the nucleotide and genome scales. The preferential co-localization of RLFS with promoters, U1 splice sites, gene ends, enhancers and non-B DNA structures, such as G-quadruplexes, provides evidence for the mechanical linkage between DNA tertiary structures, transcription initiation and R-loops in critical regulatory genome regions. We introduced and characterized an abundant class of reverse-forward RLFS clusters highly enriched in non-B DNA structures, which localized to promoters, gene ends and enhancers. The RLFS co-localization with promoters and transcriptionally active enhancers suggested new models for in cis and in trans regulation by RNA:DNA hybrids of transcription initiation and formation of 3D-chromatin loops. Overall, this study provides a rationale for the discovery and characterization of the non-B DNA regulatory structures involved in the formation of the RNA:DNA interactome as the basis for an emerging quantitative R-loop biology and pathobiology. Oxford University Press 2018-09-06 2018-06-26 /pmc/articles/PMC6125637/ /pubmed/29945198 http://dx.doi.org/10.1093/nar/gky554 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Kuznetsov, Vladimir A
Bondarenko, Vladyslav
Wongsurawat, Thidathip
Yenamandra, Surya P
Jenjaroenpun, Piroon
Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers
title Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers
title_full Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers
title_fullStr Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers
title_full_unstemmed Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers
title_short Toward predictive R-loop computational biology: genome-scale prediction of R-loops reveals their association with complex promoter structures, G-quadruplexes and transcriptionally active enhancers
title_sort toward predictive r-loop computational biology: genome-scale prediction of r-loops reveals their association with complex promoter structures, g-quadruplexes and transcriptionally active enhancers
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6125637/
https://www.ncbi.nlm.nih.gov/pubmed/29945198
http://dx.doi.org/10.1093/nar/gky554
work_keys_str_mv AT kuznetsovvladimira towardpredictiverloopcomputationalbiologygenomescalepredictionofrloopsrevealstheirassociationwithcomplexpromoterstructuresgquadruplexesandtranscriptionallyactiveenhancers
AT bondarenkovladyslav towardpredictiverloopcomputationalbiologygenomescalepredictionofrloopsrevealstheirassociationwithcomplexpromoterstructuresgquadruplexesandtranscriptionallyactiveenhancers
AT wongsurawatthidathip towardpredictiverloopcomputationalbiologygenomescalepredictionofrloopsrevealstheirassociationwithcomplexpromoterstructuresgquadruplexesandtranscriptionallyactiveenhancers
AT yenamandrasuryap towardpredictiverloopcomputationalbiologygenomescalepredictionofrloopsrevealstheirassociationwithcomplexpromoterstructuresgquadruplexesandtranscriptionallyactiveenhancers
AT jenjaroenpunpiroon towardpredictiverloopcomputationalbiologygenomescalepredictionofrloopsrevealstheirassociationwithcomplexpromoterstructuresgquadruplexesandtranscriptionallyactiveenhancers