Cargando…

Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome

RNA plays vital functional roles in almost every component of biology, and these functional roles are often influenced by its folding into secondary and tertiary structures. An important role of RNA secondary structure is in maintaining proper gene regulation; therefore, making accurate predictions...

Descripción completa

Detalles Bibliográficos
Autores principales: Rouse, Warren B., O’Leary, Collin A., Booher, Nicholas J., Moss, Walter N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9403969/
https://www.ncbi.nlm.nih.gov/pubmed/36008510
http://dx.doi.org/10.1038/s41598-022-18699-3
_version_ 1784773500921184256
author Rouse, Warren B.
O’Leary, Collin A.
Booher, Nicholas J.
Moss, Walter N.
author_facet Rouse, Warren B.
O’Leary, Collin A.
Booher, Nicholas J.
Moss, Walter N.
author_sort Rouse, Warren B.
collection PubMed
description RNA plays vital functional roles in almost every component of biology, and these functional roles are often influenced by its folding into secondary and tertiary structures. An important role of RNA secondary structure is in maintaining proper gene regulation; therefore, making accurate predictions of the structures involved in these processes is important. In this study, we have expanded on our previous work that led to the creation of the RNAStructuromeDB. Unlike this previous study that analyzed the human genome at low resolution, we have now scanned the protein-coding human transcriptome at high (single nt) resolution. This provides more robust structure predictions for over 100,000 isoforms of known protein-coding genes. Notably, we also utilize the motif identification tool, ScanFold, to model structures with high propensity for ordered/evolved stability. All data have been uploaded to the RNAStructuromeDB, allowing for easy searching of transcripts, visualization of data tracks (via the Integrative Genomics Viewer or IGV), and download of ScanFold data—including unique highly-ordered motifs. Herein, we provide an example analysis of MAT2A to demonstrate the utility of ScanFold at finding known and novel secondary structures, highlighting regions of potential functionality, and guiding generation of functional hypotheses through use of the data.
format Online
Article
Text
id pubmed-9403969
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-94039692022-08-25 Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome Rouse, Warren B. O’Leary, Collin A. Booher, Nicholas J. Moss, Walter N. Sci Rep Article RNA plays vital functional roles in almost every component of biology, and these functional roles are often influenced by its folding into secondary and tertiary structures. An important role of RNA secondary structure is in maintaining proper gene regulation; therefore, making accurate predictions of the structures involved in these processes is important. In this study, we have expanded on our previous work that led to the creation of the RNAStructuromeDB. Unlike this previous study that analyzed the human genome at low resolution, we have now scanned the protein-coding human transcriptome at high (single nt) resolution. This provides more robust structure predictions for over 100,000 isoforms of known protein-coding genes. Notably, we also utilize the motif identification tool, ScanFold, to model structures with high propensity for ordered/evolved stability. All data have been uploaded to the RNAStructuromeDB, allowing for easy searching of transcripts, visualization of data tracks (via the Integrative Genomics Viewer or IGV), and download of ScanFold data—including unique highly-ordered motifs. Herein, we provide an example analysis of MAT2A to demonstrate the utility of ScanFold at finding known and novel secondary structures, highlighting regions of potential functionality, and guiding generation of functional hypotheses through use of the data. Nature Publishing Group UK 2022-08-25 /pmc/articles/PMC9403969/ /pubmed/36008510 http://dx.doi.org/10.1038/s41598-022-18699-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Rouse, Warren B.
O’Leary, Collin A.
Booher, Nicholas J.
Moss, Walter N.
Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome
title Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome
title_full Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome
title_fullStr Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome
title_full_unstemmed Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome
title_short Expansion of the RNAStructuromeDB to include secondary structural data spanning the human protein-coding transcriptome
title_sort expansion of the rnastructuromedb to include secondary structural data spanning the human protein-coding transcriptome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9403969/
https://www.ncbi.nlm.nih.gov/pubmed/36008510
http://dx.doi.org/10.1038/s41598-022-18699-3
work_keys_str_mv AT rousewarrenb expansionofthernastructuromedbtoincludesecondarystructuraldataspanningthehumanproteincodingtranscriptome
AT olearycollina expansionofthernastructuromedbtoincludesecondarystructuraldataspanningthehumanproteincodingtranscriptome
AT boohernicholasj expansionofthernastructuromedbtoincludesecondarystructuraldataspanningthehumanproteincodingtranscriptome
AT mosswaltern expansionofthernastructuromedbtoincludesecondarystructuraldataspanningthehumanproteincodingtranscriptome