Cargando…

RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment

Alignment of structural RNAs is an important problem with a wide range of applications. Since function is often determined by molecular structure, RNA alignment programs should take into account both sequence and base-pairing information for structural homology identification. This paper describes C...

Descripción completa

Detalles Bibliográficos
Autores principales: Bayegan, Amir H., Clote, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6980424/
https://www.ncbi.nlm.nih.gov/pubmed/31978147
http://dx.doi.org/10.1371/journal.pone.0227177
_version_ 1783490951077953536
author Bayegan, Amir H.
Clote, Peter
author_facet Bayegan, Amir H.
Clote, Peter
author_sort Bayegan, Amir H.
collection PubMed
description Alignment of structural RNAs is an important problem with a wide range of applications. Since function is often determined by molecular structure, RNA alignment programs should take into account both sequence and base-pairing information for structural homology identification. This paper describes C++ software, RNAmountAlign, for RNA sequence/structure alignment that runs in O(n(3)) time and O(n(2)) space for two sequences of length n; moreover, our software returns a p-value (transformable to expect value E) based on Karlin-Altschul statistics for local alignment, as well as parameter fitting for local and global alignment. Using incremental mountain height, a representation of structural information computable in cubic time, RNAmountAlign implements quadratic time pairwise local, global and global/semiglobal (query search) alignment using a weighted combination of sequence and structural similarity. RNAmountAlign is capable of performing progressive multiple alignment as well. Benchmarking of RNAmountAlign against LocARNA, LARA, FOLDALIGN, DYNALIGN, STRAL, MXSCARNA, and MUSCLE shows that RNAmountAlign has reasonably good accuracy and faster run time supporting all alignment types. Additionally, our extension of RNAmountAlign, called RNAmountAlignScan, which scans a target genome sequence to find hits having high sequence and structural similarity to a given query sequence, outperforms RSEARCH and sequence-only query scans and runs faster than FOLDALIGN query scan.
format Online
Article
Text
id pubmed-6980424
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69804242020-02-04 RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment Bayegan, Amir H. Clote, Peter PLoS One Research Article Alignment of structural RNAs is an important problem with a wide range of applications. Since function is often determined by molecular structure, RNA alignment programs should take into account both sequence and base-pairing information for structural homology identification. This paper describes C++ software, RNAmountAlign, for RNA sequence/structure alignment that runs in O(n(3)) time and O(n(2)) space for two sequences of length n; moreover, our software returns a p-value (transformable to expect value E) based on Karlin-Altschul statistics for local alignment, as well as parameter fitting for local and global alignment. Using incremental mountain height, a representation of structural information computable in cubic time, RNAmountAlign implements quadratic time pairwise local, global and global/semiglobal (query search) alignment using a weighted combination of sequence and structural similarity. RNAmountAlign is capable of performing progressive multiple alignment as well. Benchmarking of RNAmountAlign against LocARNA, LARA, FOLDALIGN, DYNALIGN, STRAL, MXSCARNA, and MUSCLE shows that RNAmountAlign has reasonably good accuracy and faster run time supporting all alignment types. Additionally, our extension of RNAmountAlign, called RNAmountAlignScan, which scans a target genome sequence to find hits having high sequence and structural similarity to a given query sequence, outperforms RSEARCH and sequence-only query scans and runs faster than FOLDALIGN query scan. Public Library of Science 2020-01-24 /pmc/articles/PMC6980424/ /pubmed/31978147 http://dx.doi.org/10.1371/journal.pone.0227177 Text en © 2020 Bayegan, Clote http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bayegan, Amir H.
Clote, Peter
RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
title RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
title_full RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
title_fullStr RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
title_full_unstemmed RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
title_short RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
title_sort rnamountalign: efficient software for local, global, semiglobal pairwise and multiple rna sequence/structure alignment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6980424/
https://www.ncbi.nlm.nih.gov/pubmed/31978147
http://dx.doi.org/10.1371/journal.pone.0227177
work_keys_str_mv AT bayeganamirh rnamountalignefficientsoftwareforlocalglobalsemiglobalpairwiseandmultiplernasequencestructurealignment
AT clotepeter rnamountalignefficientsoftwareforlocalglobalsemiglobalpairwiseandmultiplernasequencestructurealignment