Cargando…
KnotAli: informed energy minimization through the use of evolutionary information
BACKGROUND: Improving the prediction of structures, especially those containing pseudoknots (structures with crossing base pairs) is an ongoing challenge. Homology-based methods utilize structural similarities within a family to predict the structure. However, their prediction is limited to the cons...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063079/ https://www.ncbi.nlm.nih.gov/pubmed/35505276 http://dx.doi.org/10.1186/s12859-022-04673-3 |
_version_ | 1784699091202080768 |
---|---|
author | Gray, Mateo Chester, Sean Jabbari, Hosna |
author_facet | Gray, Mateo Chester, Sean Jabbari, Hosna |
author_sort | Gray, Mateo |
collection | PubMed |
description | BACKGROUND: Improving the prediction of structures, especially those containing pseudoknots (structures with crossing base pairs) is an ongoing challenge. Homology-based methods utilize structural similarities within a family to predict the structure. However, their prediction is limited to the consensus structure, and by the quality of the alignment. Minimum free energy (MFE) based methods, on the other hand, do not rely on familial information and can predict structures of novel RNA molecules. Their prediction normally suffers from inaccuracies due to their underlying energy parameters. RESULTS: We present a new method for prediction of RNA pseudoknotted secondary structures that combines the strengths of MFE prediction and alignment-based methods. KnotAli takes a multiple RNA sequence alignment as input and uses covariation and thermodynamic energy minimization to predict possibly pseudoknotted secondary structures for each individual sequence in the alignment. We compared KnotAli’s performance to that of three other alignment-based programs, two that can handle pseudoknotted structures and one control, on a large data set of 3034 RNA sequences with varying lengths and levels of sequence conservation from 10 families with pseudoknotted and pseudoknot-free reference structures. We produced sequence alignments for each family using two well-known sequence aligners (MUSCLE and MAFFT). CONCLUSIONS: We found KnotAli’s performance to be superior in 6 of the 10 families for MUSCLE and 7 of the 10 for MAFFT. While both KnotAli and Cacofold use background noise correction strategies, we found KnotAli’s predictions to be less dependent on the alignment quality. KnotAli can be found online at the Zenodo image: 10.5281/zenodo.5794719 |
format | Online Article Text |
id | pubmed-9063079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-90630792022-05-04 KnotAli: informed energy minimization through the use of evolutionary information Gray, Mateo Chester, Sean Jabbari, Hosna BMC Bioinformatics Research BACKGROUND: Improving the prediction of structures, especially those containing pseudoknots (structures with crossing base pairs) is an ongoing challenge. Homology-based methods utilize structural similarities within a family to predict the structure. However, their prediction is limited to the consensus structure, and by the quality of the alignment. Minimum free energy (MFE) based methods, on the other hand, do not rely on familial information and can predict structures of novel RNA molecules. Their prediction normally suffers from inaccuracies due to their underlying energy parameters. RESULTS: We present a new method for prediction of RNA pseudoknotted secondary structures that combines the strengths of MFE prediction and alignment-based methods. KnotAli takes a multiple RNA sequence alignment as input and uses covariation and thermodynamic energy minimization to predict possibly pseudoknotted secondary structures for each individual sequence in the alignment. We compared KnotAli’s performance to that of three other alignment-based programs, two that can handle pseudoknotted structures and one control, on a large data set of 3034 RNA sequences with varying lengths and levels of sequence conservation from 10 families with pseudoknotted and pseudoknot-free reference structures. We produced sequence alignments for each family using two well-known sequence aligners (MUSCLE and MAFFT). CONCLUSIONS: We found KnotAli’s performance to be superior in 6 of the 10 families for MUSCLE and 7 of the 10 for MAFFT. While both KnotAli and Cacofold use background noise correction strategies, we found KnotAli’s predictions to be less dependent on the alignment quality. KnotAli can be found online at the Zenodo image: 10.5281/zenodo.5794719 BioMed Central 2022-05-03 /pmc/articles/PMC9063079/ /pubmed/35505276 http://dx.doi.org/10.1186/s12859-022-04673-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Gray, Mateo Chester, Sean Jabbari, Hosna KnotAli: informed energy minimization through the use of evolutionary information |
title | KnotAli: informed energy minimization through the use of evolutionary information |
title_full | KnotAli: informed energy minimization through the use of evolutionary information |
title_fullStr | KnotAli: informed energy minimization through the use of evolutionary information |
title_full_unstemmed | KnotAli: informed energy minimization through the use of evolutionary information |
title_short | KnotAli: informed energy minimization through the use of evolutionary information |
title_sort | knotali: informed energy minimization through the use of evolutionary information |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063079/ https://www.ncbi.nlm.nih.gov/pubmed/35505276 http://dx.doi.org/10.1186/s12859-022-04673-3 |
work_keys_str_mv | AT graymateo knotaliinformedenergyminimizationthroughtheuseofevolutionaryinformation AT chestersean knotaliinformedenergyminimizationthroughtheuseofevolutionaryinformation AT jabbarihosna knotaliinformedenergyminimizationthroughtheuseofevolutionaryinformation |