Cargando…

Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites

RNA molecules are known to fold into specific structures which often play a central role in their functions and regulation. In silico folding of RNA transcripts, especially when assisted with structure profiling (SP) data, is capable of accurately elucidating relevant structural conformations. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales: Radecki, Pierce, Uppuluri, Rahul, Deshpande, Kaustubh, Aviran, Sharon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677038/
https://www.ncbi.nlm.nih.gov/pubmed/34606413
http://dx.doi.org/10.1080/15476286.2021.1971382
_version_ 1784616058165919744
author Radecki, Pierce
Uppuluri, Rahul
Deshpande, Kaustubh
Aviran, Sharon
author_facet Radecki, Pierce
Uppuluri, Rahul
Deshpande, Kaustubh
Aviran, Sharon
author_sort Radecki, Pierce
collection PubMed
description RNA molecules are known to fold into specific structures which often play a central role in their functions and regulation. In silico folding of RNA transcripts, especially when assisted with structure profiling (SP) data, is capable of accurately elucidating relevant structural conformations. However, such methods scale poorly to the swaths of SP data generated by transcriptome-wide experiments, which are becoming more commonplace and advancing our understanding of RNA structure and its regulation at global and local levels. This has created a need for tools capable of rapidly deriving structural assessments from SP data in a scalable manner. One such tool we previously introduced that aims to process such data is patteRNA, a statistical learning algorithm capable of rapidly mining big SP datasets for structural elements. Here, we present a reformulation of patteRNA’s pattern recognition scheme that sees significantly improved precision without major compromises to computational overhead. Specifically, we developed a data-driven logistic classifier which interprets patteRNA’s statistical characterizations of SP data in addition to local sequence properties as measured with a nearest neighbour thermodynamic model. Application of the classifier to human structurome data reveals a marked association between detected stem-loops and RNA binding protein (RBP) footprints. The results of our application demonstrate that upwards of 30% of RBP footprints occur within loops of stable stem-loop elements. Overall, our work arrives at a rapid and accurate method for automatically detecting families of RNA structure motifs and demonstrates the functional relevance of identifying them transcriptome-wide.
format Online
Article
Text
id pubmed-8677038
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-86770382022-02-07 Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites Radecki, Pierce Uppuluri, Rahul Deshpande, Kaustubh Aviran, Sharon RNA Biol Research Paper RNA molecules are known to fold into specific structures which often play a central role in their functions and regulation. In silico folding of RNA transcripts, especially when assisted with structure profiling (SP) data, is capable of accurately elucidating relevant structural conformations. However, such methods scale poorly to the swaths of SP data generated by transcriptome-wide experiments, which are becoming more commonplace and advancing our understanding of RNA structure and its regulation at global and local levels. This has created a need for tools capable of rapidly deriving structural assessments from SP data in a scalable manner. One such tool we previously introduced that aims to process such data is patteRNA, a statistical learning algorithm capable of rapidly mining big SP datasets for structural elements. Here, we present a reformulation of patteRNA’s pattern recognition scheme that sees significantly improved precision without major compromises to computational overhead. Specifically, we developed a data-driven logistic classifier which interprets patteRNA’s statistical characterizations of SP data in addition to local sequence properties as measured with a nearest neighbour thermodynamic model. Application of the classifier to human structurome data reveals a marked association between detected stem-loops and RNA binding protein (RBP) footprints. The results of our application demonstrate that upwards of 30% of RBP footprints occur within loops of stable stem-loop elements. Overall, our work arrives at a rapid and accurate method for automatically detecting families of RNA structure motifs and demonstrates the functional relevance of identifying them transcriptome-wide. Taylor & Francis 2021-10-04 /pmc/articles/PMC8677038/ /pubmed/34606413 http://dx.doi.org/10.1080/15476286.2021.1971382 Text en © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.
spellingShingle Research Paper
Radecki, Pierce
Uppuluri, Rahul
Deshpande, Kaustubh
Aviran, Sharon
Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites
title Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites
title_full Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites
title_fullStr Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites
title_full_unstemmed Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites
title_short Accurate detection of RNA stem-loops in structurome data reveals widespread association with protein binding sites
title_sort accurate detection of rna stem-loops in structurome data reveals widespread association with protein binding sites
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677038/
https://www.ncbi.nlm.nih.gov/pubmed/34606413
http://dx.doi.org/10.1080/15476286.2021.1971382
work_keys_str_mv AT radeckipierce accuratedetectionofrnastemloopsinstructuromedatarevealswidespreadassociationwithproteinbindingsites
AT uppulurirahul accuratedetectionofrnastemloopsinstructuromedatarevealswidespreadassociationwithproteinbindingsites
AT deshpandekaustubh accuratedetectionofrnastemloopsinstructuromedatarevealswidespreadassociationwithproteinbindingsites
AT aviransharon accuratedetectionofrnastemloopsinstructuromedatarevealswidespreadassociationwithproteinbindingsites