Cargando…

Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions

R-loops are three-stranded nucleic acid structures formed from the hybridization of RNA and DNA. While the pathological consequences of R-loops have been well-studied to date, the locations, classes, and dynamics of physiological R-loops remain poorly understood. R-loop mapping studies provide insig...

Descripción completa

Detalles Bibliográficos
Autores principales: Miller, Henry E, Montemayor, Daniel, Abdul, Jebriel, Vines, Anna, Levy, Simon A, Hartono, Stella R, Sharma, Kumar, Frost, Bess, Chédin, Frédéric, Bishop, Alexander J R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303298/
https://www.ncbi.nlm.nih.gov/pubmed/35758606
http://dx.doi.org/10.1093/nar/gkac537
_version_ 1784751829144305664
author Miller, Henry E
Montemayor, Daniel
Abdul, Jebriel
Vines, Anna
Levy, Simon A
Hartono, Stella R
Sharma, Kumar
Frost, Bess
Chédin, Frédéric
Bishop, Alexander J R
author_facet Miller, Henry E
Montemayor, Daniel
Abdul, Jebriel
Vines, Anna
Levy, Simon A
Hartono, Stella R
Sharma, Kumar
Frost, Bess
Chédin, Frédéric
Bishop, Alexander J R
author_sort Miller, Henry E
collection PubMed
description R-loops are three-stranded nucleic acid structures formed from the hybridization of RNA and DNA. While the pathological consequences of R-loops have been well-studied to date, the locations, classes, and dynamics of physiological R-loops remain poorly understood. R-loop mapping studies provide insight into R-loop dynamics, but their findings are challenging to generalize. This is due to the narrow biological scope of individual studies, the limitations of each mapping modality, and, in some cases, poor data quality. In this study, we reprocessed 810 R-loop mapping datasets from a wide array of biological conditions and mapping modalities. From this data resource, we developed an accurate R-loop data quality control method, and we reveal the extent of poor-quality data within previously published studies. We then identified a set of high-confidence R-loop mapping samples and used them to define consensus R-loop sites called ‘R-loop regions’ (RL regions). In the process, we identified a stark divergence between RL regions detected by S9.6 and dRNH-based mapping methods, particularly with respect to R-loop size, location, and colocalization with RNA binding factors. Taken together, this work provides a much-needed method to assess R-loop data quality and offers novel context regarding the differences between dRNH- and S9.6-based R-loop mapping approaches.
format Online
Article
Text
id pubmed-9303298
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93032982022-07-22 Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions Miller, Henry E Montemayor, Daniel Abdul, Jebriel Vines, Anna Levy, Simon A Hartono, Stella R Sharma, Kumar Frost, Bess Chédin, Frédéric Bishop, Alexander J R Nucleic Acids Res Computational Biology R-loops are three-stranded nucleic acid structures formed from the hybridization of RNA and DNA. While the pathological consequences of R-loops have been well-studied to date, the locations, classes, and dynamics of physiological R-loops remain poorly understood. R-loop mapping studies provide insight into R-loop dynamics, but their findings are challenging to generalize. This is due to the narrow biological scope of individual studies, the limitations of each mapping modality, and, in some cases, poor data quality. In this study, we reprocessed 810 R-loop mapping datasets from a wide array of biological conditions and mapping modalities. From this data resource, we developed an accurate R-loop data quality control method, and we reveal the extent of poor-quality data within previously published studies. We then identified a set of high-confidence R-loop mapping samples and used them to define consensus R-loop sites called ‘R-loop regions’ (RL regions). In the process, we identified a stark divergence between RL regions detected by S9.6 and dRNH-based mapping methods, particularly with respect to R-loop size, location, and colocalization with RNA binding factors. Taken together, this work provides a much-needed method to assess R-loop data quality and offers novel context regarding the differences between dRNH- and S9.6-based R-loop mapping approaches. Oxford University Press 2022-06-27 /pmc/articles/PMC9303298/ /pubmed/35758606 http://dx.doi.org/10.1093/nar/gkac537 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Miller, Henry E
Montemayor, Daniel
Abdul, Jebriel
Vines, Anna
Levy, Simon A
Hartono, Stella R
Sharma, Kumar
Frost, Bess
Chédin, Frédéric
Bishop, Alexander J R
Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
title Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
title_full Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
title_fullStr Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
title_full_unstemmed Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
title_short Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
title_sort quality-controlled r-loop meta-analysis reveals the characteristics of r-loop consensus regions
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303298/
https://www.ncbi.nlm.nih.gov/pubmed/35758606
http://dx.doi.org/10.1093/nar/gkac537
work_keys_str_mv AT millerhenrye qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT montemayordaniel qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT abduljebriel qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT vinesanna qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT levysimona qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT hartonostellar qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT sharmakumar qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT frostbess qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT chedinfrederic qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions
AT bishopalexanderjr qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions