Cargando…
QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity
Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence–activity data. Given pairs of R...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303433/ https://www.ncbi.nlm.nih.gov/pubmed/35390152 http://dx.doi.org/10.1093/nar/gkac220 |
_version_ | 1784751863656087552 |
---|---|
author | Terai, Goro Asai, Kiyoshi |
author_facet | Terai, Goro Asai, Kiyoshi |
author_sort | Terai, Goro |
collection | PubMed |
description | Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence–activity data. Given pairs of RNA sequences and their corresponding bioactivity values, our method calculates position-specific structural features of the input RNA sequences, considering every possible secondary structure of each RNA. A Ridge regression model is trained using the structural features as feature vectors and the bioactivity values as response variables. Optimized model parameters indicate how secondary structure features affect bioactivity. We used our method to extract intramolecular structural features of bacterial translation initiation sites and self-cleaving ribozymes, and the intermolecular features between rRNAs and Shine–Dalgarno sequences and between U1 RNAs and splicing sites. We not only identified known structural features but also revealed more detailed insights into structure–activity relationships than previously reported. Importantly, the datasets we analyzed here were obtained from different experimental systems and differed in size, sequence length and similarity, and number of RNA molecules involved, demonstrating that our method is applicable to various types of data consisting of RNA sequences and bioactivity values. |
format | Online Article Text |
id | pubmed-9303433 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-93034332022-07-22 QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity Terai, Goro Asai, Kiyoshi Nucleic Acids Res Methods Online Recent technological advances have enabled the generation of large amounts of data consisting of RNA sequences and their functional activity. Here, we propose a method for extracting secondary structure features that affect the functional activity of RNA from sequence–activity data. Given pairs of RNA sequences and their corresponding bioactivity values, our method calculates position-specific structural features of the input RNA sequences, considering every possible secondary structure of each RNA. A Ridge regression model is trained using the structural features as feature vectors and the bioactivity values as response variables. Optimized model parameters indicate how secondary structure features affect bioactivity. We used our method to extract intramolecular structural features of bacterial translation initiation sites and self-cleaving ribozymes, and the intermolecular features between rRNAs and Shine–Dalgarno sequences and between U1 RNAs and splicing sites. We not only identified known structural features but also revealed more detailed insights into structure–activity relationships than previously reported. Importantly, the datasets we analyzed here were obtained from different experimental systems and differed in size, sequence length and similarity, and number of RNA molecules involved, demonstrating that our method is applicable to various types of data consisting of RNA sequences and bioactivity values. Oxford University Press 2022-04-07 /pmc/articles/PMC9303433/ /pubmed/35390152 http://dx.doi.org/10.1093/nar/gkac220 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Terai, Goro Asai, Kiyoshi QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity |
title | QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity |
title_full | QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity |
title_fullStr | QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity |
title_full_unstemmed | QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity |
title_short | QRNAstruct: a method for extracting secondary structural features of RNA via regression with biological activity |
title_sort | qrnastruct: a method for extracting secondary structural features of rna via regression with biological activity |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303433/ https://www.ncbi.nlm.nih.gov/pubmed/35390152 http://dx.doi.org/10.1093/nar/gkac220 |
work_keys_str_mv | AT teraigoro qrnastructamethodforextractingsecondarystructuralfeaturesofrnaviaregressionwithbiologicalactivity AT asaikiyoshi qrnastructamethodforextractingsecondarystructuralfeaturesofrnaviaregressionwithbiologicalactivity |