Cargando…

QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity

Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence–activity data. Given pairs of R...

Descripción completa

Detalles Bibliográficos
Autores principales: Terai, Goro, Asai, Kiyoshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303433/
https://www.ncbi.nlm.nih.gov/pubmed/35390152
http://dx.doi.org/10.1093/nar/gkac220
_version_ 1784751863656087552
author Terai, Goro
Asai, Kiyoshi
author_facet Terai, Goro
Asai, Kiyoshi
author_sort Terai, Goro
collection PubMed
description Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence–activity data. Given pairs of RNA sequences and their corresponding bioactivity values, our method calculates position-specific structural features of the input RNA sequences, considering every possible secondary structure of each RNA. A Ridge regression model is trained using the structural features as feature vectors and the bioactivity values as response variables. Optimized model parameters indicate how secondary structure features affect bioactivity. We used our method to extract intramolecular structural features of bacterial translation initiation sites and self-cleaving ribozymes, and the intermolecular features between rRNAs and Shine–Dalgarno sequences and between U1 RNAs and splicing sites. We not only identified known structural features but also revealed more detailed insights into structure–activity relationships than previously reported. Importantly, the datasets we analyzed here were obtained from different experimental systems and differed in size, sequence length and similarity, and number of RNA molecules involved, demonstrating that our method is applicable to various types of data consisting of RNA sequences and bioactivity values.
format Online
Article
Text
id pubmed-9303433
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93034332022-07-22 QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity Terai, Goro Asai, Kiyoshi Nucleic Acids Res Methods Online Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence–activity data. Given pairs of RNA sequences and their corresponding bioactivity values, our method calculates position-specific structural features of the input RNA sequences, considering every possible secondary structure of each RNA. A Ridge regression model is trained using the structural features as feature vectors and the bioactivity values as response variables. Optimized model parameters indicate how secondary structure features affect bioactivity. We used our method to extract intramolecular structural features of bacterial translation initiation sites and self-cleaving ribozymes, and the intermolecular features between rRNAs and Shine–Dalgarno sequences and between U1 RNAs and splicing sites. We not only identified known structural features but also revealed more detailed insights into structure–activity relationships than previously reported. Importantly, the datasets we analyzed here were obtained from different experimental systems and differed in size, sequence length and similarity, and number of RNA molecules involved, demonstrating that our method is applicable to various types of data consisting of RNA sequences and bioactivity values. Oxford University Press 2022-04-07 /pmc/articles/PMC9303433/ /pubmed/35390152 http://dx.doi.org/10.1093/nar/gkac220 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Terai, Goro
Asai, Kiyoshi
QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
title QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
title_full QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
title_fullStr QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
title_full_unstemmed QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
title_short QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
title_sort qrnastruct: a method for extracting secondary structural features of rna via regression with biological activity
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303433/
https://www.ncbi.nlm.nih.gov/pubmed/35390152
http://dx.doi.org/10.1093/nar/gkac220
work_keys_str_mv AT teraigoro qrnastructamethodforextractingsecondarystructuralfeaturesofrnaviaregressionwithbiologicalactivity
AT asaikiyoshi qrnastructamethodforextractingsecondarystructuralfeaturesofrnaviaregressionwithbiologicalactivity