Cargando…
Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions
R-loops are three-stranded nucleic acid structures formed from the hybridization of RNA and DNA. While the pathological consequences of R-loops have been well-studied to date, the locations, classes, and dynamics of physiological R-loops remain poorly understood. R-loop mapping studies provide insig...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303298/ https://www.ncbi.nlm.nih.gov/pubmed/35758606 http://dx.doi.org/10.1093/nar/gkac537 |
_version_ | 1784751829144305664 |
---|---|
author | Miller, Henry E Montemayor, Daniel Abdul, Jebriel Vines, Anna Levy, Simon A Hartono, Stella R Sharma, Kumar Frost, Bess Chédin, Frédéric Bishop, Alexander J R |
author_facet | Miller, Henry E Montemayor, Daniel Abdul, Jebriel Vines, Anna Levy, Simon A Hartono, Stella R Sharma, Kumar Frost, Bess Chédin, Frédéric Bishop, Alexander J R |
author_sort | Miller, Henry E |
collection | PubMed |
description | R-loops are three-stranded nucleic acid structures formed from the hybridization of RNA and DNA. While the pathological consequences of R-loops have been well-studied to date, the locations, classes, and dynamics of physiological R-loops remain poorly understood. R-loop mapping studies provide insight into R-loop dynamics, but their findings are challenging to generalize. This is due to the narrow biological scope of individual studies, the limitations of each mapping modality, and, in some cases, poor data quality. In this study, we reprocessed 810 R-loop mapping datasets from a wide array of biological conditions and mapping modalities. From this data resource, we developed an accurate R-loop data quality control method, and we reveal the extent of poor-quality data within previously published studies. We then identified a set of high-confidence R-loop mapping samples and used them to define consensus R-loop sites called ‘R-loop regions’ (RL regions). In the process, we identified a stark divergence between RL regions detected by S9.6 and dRNH-based mapping methods, particularly with respect to R-loop size, location, and colocalization with RNA binding factors. Taken together, this work provides a much-needed method to assess R-loop data quality and offers novel context regarding the differences between dRNH- and S9.6-based R-loop mapping approaches. |
format | Online Article Text |
id | pubmed-9303298 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-93032982022-07-22 Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions Miller, Henry E Montemayor, Daniel Abdul, Jebriel Vines, Anna Levy, Simon A Hartono, Stella R Sharma, Kumar Frost, Bess Chédin, Frédéric Bishop, Alexander J R Nucleic Acids Res Computational Biology R-loops are three-stranded nucleic acid structures formed from the hybridization of RNA and DNA. While the pathological consequences of R-loops have been well-studied to date, the locations, classes, and dynamics of physiological R-loops remain poorly understood. R-loop mapping studies provide insight into R-loop dynamics, but their findings are challenging to generalize. This is due to the narrow biological scope of individual studies, the limitations of each mapping modality, and, in some cases, poor data quality. In this study, we reprocessed 810 R-loop mapping datasets from a wide array of biological conditions and mapping modalities. From this data resource, we developed an accurate R-loop data quality control method, and we reveal the extent of poor-quality data within previously published studies. We then identified a set of high-confidence R-loop mapping samples and used them to define consensus R-loop sites called ‘R-loop regions’ (RL regions). In the process, we identified a stark divergence between RL regions detected by S9.6 and dRNH-based mapping methods, particularly with respect to R-loop size, location, and colocalization with RNA binding factors. Taken together, this work provides a much-needed method to assess R-loop data quality and offers novel context regarding the differences between dRNH- and S9.6-based R-loop mapping approaches. Oxford University Press 2022-06-27 /pmc/articles/PMC9303298/ /pubmed/35758606 http://dx.doi.org/10.1093/nar/gkac537 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Miller, Henry E Montemayor, Daniel Abdul, Jebriel Vines, Anna Levy, Simon A Hartono, Stella R Sharma, Kumar Frost, Bess Chédin, Frédéric Bishop, Alexander J R Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions |
title | Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions |
title_full | Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions |
title_fullStr | Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions |
title_full_unstemmed | Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions |
title_short | Quality-controlled R-loop meta-analysis reveals the characteristics of R-loop consensus regions |
title_sort | quality-controlled r-loop meta-analysis reveals the characteristics of r-loop consensus regions |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303298/ https://www.ncbi.nlm.nih.gov/pubmed/35758606 http://dx.doi.org/10.1093/nar/gkac537 |
work_keys_str_mv | AT millerhenrye qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT montemayordaniel qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT abduljebriel qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT vinesanna qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT levysimona qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT hartonostellar qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT sharmakumar qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT frostbess qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT chedinfrederic qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions AT bishopalexanderjr qualitycontrolledrloopmetaanalysisrevealsthecharacteristicsofrloopconsensusregions |