Cargando…

Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences

BACKGROUND: The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings...

Descripción completa

Detalles Bibliográficos
Autores principales: Engelen, Stéfan, Tahi, Fariza
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238770/
https://www.ncbi.nlm.nih.gov/pubmed/18045491
http://dx.doi.org/10.1186/1471-2105-8-464
_version_ 1782150460538355712
author Engelen, Stéfan
Tahi, Fariza
author_facet Engelen, Stéfan
Tahi, Fariza
author_sort Engelen, Stéfan
collection PubMed
description BACKGROUND: The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings). This "comparative approach" consists of identifying mutations from homologous sequence alignments. The sequences must covary enough for compensatory mutations to be revealed, but comparison is difficult if they are too different. Thus the choice of homologous sequences is critical. While many possible combinations of homologous sequences may be used for prediction, only a few will give good structure predictions. This can be due to poor quality alignment in stems or to the variability of certain sequences. This problem of sequence selection is currently unsolved. RESULTS: This paper describes an algorithm, SSCA, which measures the suitability of sequences for the comparative approach. It is based on evolutionary models with structure constraints, particularly those on sequence variations and stem alignment. We propose three models, based on different constraints on sequence alignments. We show the results of the SSCA algorithm for predicting the secondary structure of several RNAs. SSCA enabled us to choose sets of homologous sequences that gave better predictions than arbitrarily chosen sets of homologous sequences. CONCLUSION: SSCA is an algorithm for selecting combinations of RNA homologous sequences suitable for secondary structure predictions with the comparative approach.
format Text
id pubmed-2238770
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22387702008-02-12 Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences Engelen, Stéfan Tahi, Fariza BMC Bioinformatics Research Article BACKGROUND: The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings). This "comparative approach" consists of identifying mutations from homologous sequence alignments. The sequences must covary enough for compensatory mutations to be revealed, but comparison is difficult if they are too different. Thus the choice of homologous sequences is critical. While many possible combinations of homologous sequences may be used for prediction, only a few will give good structure predictions. This can be due to poor quality alignment in stems or to the variability of certain sequences. This problem of sequence selection is currently unsolved. RESULTS: This paper describes an algorithm, SSCA, which measures the suitability of sequences for the comparative approach. It is based on evolutionary models with structure constraints, particularly those on sequence variations and stem alignment. We propose three models, based on different constraints on sequence alignments. We show the results of the SSCA algorithm for predicting the secondary structure of several RNAs. SSCA enabled us to choose sets of homologous sequences that gave better predictions than arbitrarily chosen sets of homologous sequences. CONCLUSION: SSCA is an algorithm for selecting combinations of RNA homologous sequences suitable for secondary structure predictions with the comparative approach. BioMed Central 2007-11-28 /pmc/articles/PMC2238770/ /pubmed/18045491 http://dx.doi.org/10.1186/1471-2105-8-464 Text en Copyright © 2007 Engelen and Tahi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Engelen, Stéfan
Tahi, Fariza
Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences
title Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences
title_full Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences
title_fullStr Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences
title_full_unstemmed Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences
title_short Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences
title_sort predicting rna secondary structure by the comparative approach: how to select the homologous sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2238770/
https://www.ncbi.nlm.nih.gov/pubmed/18045491
http://dx.doi.org/10.1186/1471-2105-8-464
work_keys_str_mv AT engelenstefan predictingrnasecondarystructurebythecomparativeapproachhowtoselectthehomologoussequences
AT tahifariza predictingrnasecondarystructurebythecomparativeapproachhowtoselectthehomologoussequences