Cargando…

KnotAli: informed energy minimization through the use of evolutionary information

BACKGROUND: Improving the prediction of structures, especially those containing pseudoknots (structures with crossing base pairs) is an ongoing challenge. Homology-based methods utilize structural similarities within a family to predict the structure. However, their prediction is limited to the cons...

Descripción completa

Detalles Bibliográficos
Autores principales: Gray, Mateo, Chester, Sean, Jabbari, Hosna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063079/
https://www.ncbi.nlm.nih.gov/pubmed/35505276
http://dx.doi.org/10.1186/s12859-022-04673-3
_version_ 1784699091202080768
author Gray, Mateo
Chester, Sean
Jabbari, Hosna
author_facet Gray, Mateo
Chester, Sean
Jabbari, Hosna
author_sort Gray, Mateo
collection PubMed
description BACKGROUND: Improving the prediction of structures, especially those containing pseudoknots (structures with crossing base pairs) is an ongoing challenge. Homology-based methods utilize structural similarities within a family to predict the structure. However, their prediction is limited to the consensus structure, and by the quality of the alignment. Minimum free energy (MFE) based methods, on the other hand, do not rely on familial information and can predict structures of novel RNA molecules. Their prediction normally suffers from inaccuracies due to their underlying energy parameters. RESULTS: We present a new method for prediction of RNA pseudoknotted secondary structures that combines the strengths of MFE prediction and alignment-based methods. KnotAli takes a multiple RNA sequence alignment as input and uses covariation and thermodynamic energy minimization to predict possibly pseudoknotted secondary structures for each individual sequence in the alignment. We compared KnotAli’s performance to that of three other alignment-based programs, two that can handle pseudoknotted structures and one control, on a large data set of 3034 RNA sequences with varying lengths and levels of sequence conservation from 10 families with pseudoknotted and pseudoknot-free reference structures. We produced sequence alignments for each family using two well-known sequence aligners (MUSCLE and MAFFT). CONCLUSIONS: We found KnotAli’s performance to be superior in 6 of the 10 families for MUSCLE and 7 of the 10 for MAFFT. While both KnotAli and Cacofold use background noise correction strategies, we found KnotAli’s predictions to be less dependent on the alignment quality. KnotAli can be found online at the Zenodo image: 10.5281/zenodo.5794719
format Online
Article
Text
id pubmed-9063079
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90630792022-05-04 KnotAli: informed energy minimization through the use of evolutionary information Gray, Mateo Chester, Sean Jabbari, Hosna BMC Bioinformatics Research BACKGROUND: Improving the prediction of structures, especially those containing pseudoknots (structures with crossing base pairs) is an ongoing challenge. Homology-based methods utilize structural similarities within a family to predict the structure. However, their prediction is limited to the consensus structure, and by the quality of the alignment. Minimum free energy (MFE) based methods, on the other hand, do not rely on familial information and can predict structures of novel RNA molecules. Their prediction normally suffers from inaccuracies due to their underlying energy parameters. RESULTS: We present a new method for prediction of RNA pseudoknotted secondary structures that combines the strengths of MFE prediction and alignment-based methods. KnotAli takes a multiple RNA sequence alignment as input and uses covariation and thermodynamic energy minimization to predict possibly pseudoknotted secondary structures for each individual sequence in the alignment. We compared KnotAli’s performance to that of three other alignment-based programs, two that can handle pseudoknotted structures and one control, on a large data set of 3034 RNA sequences with varying lengths and levels of sequence conservation from 10 families with pseudoknotted and pseudoknot-free reference structures. We produced sequence alignments for each family using two well-known sequence aligners (MUSCLE and MAFFT). CONCLUSIONS: We found KnotAli’s performance to be superior in 6 of the 10 families for MUSCLE and 7 of the 10 for MAFFT. While both KnotAli and Cacofold use background noise correction strategies, we found KnotAli’s predictions to be less dependent on the alignment quality. KnotAli can be found online at the Zenodo image: 10.5281/zenodo.5794719 BioMed Central 2022-05-03 /pmc/articles/PMC9063079/ /pubmed/35505276 http://dx.doi.org/10.1186/s12859-022-04673-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Gray, Mateo
Chester, Sean
Jabbari, Hosna
KnotAli: informed energy minimization through the use of evolutionary information
title KnotAli: informed energy minimization through the use of evolutionary information
title_full KnotAli: informed energy minimization through the use of evolutionary information
title_fullStr KnotAli: informed energy minimization through the use of evolutionary information
title_full_unstemmed KnotAli: informed energy minimization through the use of evolutionary information
title_short KnotAli: informed energy minimization through the use of evolutionary information
title_sort knotali: informed energy minimization through the use of evolutionary information
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063079/
https://www.ncbi.nlm.nih.gov/pubmed/35505276
http://dx.doi.org/10.1186/s12859-022-04673-3
work_keys_str_mv AT graymateo knotaliinformedenergyminimizationthroughtheuseofevolutionaryinformation
AT chestersean knotaliinformedenergyminimizationthroughtheuseofevolutionaryinformation
AT jabbarihosna knotaliinformedenergyminimizationthroughtheuseofevolutionaryinformation