Cargando…

Lightweight comparison of RNAs based on exact sequence–structure matches

Motivation: Specific functions of ribonucleic acid (RNA) molecules are often associated with different motifs in the RNA structure. The key feature that forms such an RNA motif is the combination of sequence and structure properties. In this article, we introduce a new RNA sequence–structure compari...

Descripción completa

Detalles Bibliográficos
Autores principales: Heyne, Steffen, Will, Sebastian, Beckstette, Michael, Backofen, Rolf
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722993/
https://www.ncbi.nlm.nih.gov/pubmed/19189979
http://dx.doi.org/10.1093/bioinformatics/btp065
_version_ 1782170344700772352
author Heyne, Steffen
Will, Sebastian
Beckstette, Michael
Backofen, Rolf
author_facet Heyne, Steffen
Will, Sebastian
Beckstette, Michael
Backofen, Rolf
author_sort Heyne, Steffen
collection PubMed
description Motivation: Specific functions of ribonucleic acid (RNA) molecules are often associated with different motifs in the RNA structure. The key feature that forms such an RNA motif is the combination of sequence and structure properties. In this article, we introduce a new RNA sequence–structure comparison method which maintains exact matching substructures. Existing common substructures are treated as whole unit while variability is allowed between such structural motifs. Based on a fast detectable set of overlapping and crossing substructure matches for two nested RNA secondary structures, our method ExpaRNA (exact pattern of alignment of RNA) computes the longest collinear sequence of substructures common to two RNAs in O(H·nm) time and O(nm) space, where H ≪ n·m for real RNA structures. Applied to different RNAs, our method correctly identifies sequence–structure similarities between two RNAs. Results: We have compared ExpaRNA with two other alignment methods that work with given RNA structures, namely RNAforester and RNA_align. The results are in good agreement, but can be obtained in a fraction of running time, in particular for larger RNAs. We have also used ExpaRNA to speed up state-of-the-art Sankoff-style alignment tools like LocARNA, and observe a tradeoff between quality and speed. However, we get a speedup of 4.25 even in the highest quality setting, where the quality of the produced alignment is comparable to that of LocARNA alone. Availability: The presented algorithm is implemented in the program ExpaRNA, which is available from our website (http://www.bioinf.uni-freiburg.de/Software). Contact: {exparna@informatik.uni-freiburg.de,backofen@informatik.uni-freiburg.de} Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2722993
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27229932009-08-07 Lightweight comparison of RNAs based on exact sequence–structure matches Heyne, Steffen Will, Sebastian Beckstette, Michael Backofen, Rolf Bioinformatics German Conference on Bioinformatics Motivation: Specific functions of ribonucleic acid (RNA) molecules are often associated with different motifs in the RNA structure. The key feature that forms such an RNA motif is the combination of sequence and structure properties. In this article, we introduce a new RNA sequence–structure comparison method which maintains exact matching substructures. Existing common substructures are treated as whole unit while variability is allowed between such structural motifs. Based on a fast detectable set of overlapping and crossing substructure matches for two nested RNA secondary structures, our method ExpaRNA (exact pattern of alignment of RNA) computes the longest collinear sequence of substructures common to two RNAs in O(H·nm) time and O(nm) space, where H ≪ n·m for real RNA structures. Applied to different RNAs, our method correctly identifies sequence–structure similarities between two RNAs. Results: We have compared ExpaRNA with two other alignment methods that work with given RNA structures, namely RNAforester and RNA_align. The results are in good agreement, but can be obtained in a fraction of running time, in particular for larger RNAs. We have also used ExpaRNA to speed up state-of-the-art Sankoff-style alignment tools like LocARNA, and observe a tradeoff between quality and speed. However, we get a speedup of 4.25 even in the highest quality setting, where the quality of the produced alignment is comparable to that of LocARNA alone. Availability: The presented algorithm is implemented in the program ExpaRNA, which is available from our website (http://www.bioinf.uni-freiburg.de/Software). Contact: {exparna@informatik.uni-freiburg.de,backofen@informatik.uni-freiburg.de} Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-08-15 2009-02-02 /pmc/articles/PMC2722993/ /pubmed/19189979 http://dx.doi.org/10.1093/bioinformatics/btp065 Text en http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle German Conference on Bioinformatics
Heyne, Steffen
Will, Sebastian
Beckstette, Michael
Backofen, Rolf
Lightweight comparison of RNAs based on exact sequence–structure matches
title Lightweight comparison of RNAs based on exact sequence–structure matches
title_full Lightweight comparison of RNAs based on exact sequence–structure matches
title_fullStr Lightweight comparison of RNAs based on exact sequence–structure matches
title_full_unstemmed Lightweight comparison of RNAs based on exact sequence–structure matches
title_short Lightweight comparison of RNAs based on exact sequence–structure matches
title_sort lightweight comparison of rnas based on exact sequence–structure matches
topic German Conference on Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722993/
https://www.ncbi.nlm.nih.gov/pubmed/19189979
http://dx.doi.org/10.1093/bioinformatics/btp065
work_keys_str_mv AT heynesteffen lightweightcomparisonofrnasbasedonexactsequencestructurematches
AT willsebastian lightweightcomparisonofrnasbasedonexactsequencestructurematches
AT beckstettemichael lightweightcomparisonofrnasbasedonexactsequencestructurematches
AT backofenrolf lightweightcomparisonofrnasbasedonexactsequencestructurematches