Cargando…

RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix

BACKGROUND: The functions of RNA molecules are mainly determined by their secondary structures. These functions can also be predicted using bioinformatic tools that enable the alignment of multiple RNAs to determine functional domains and/or classify RNA molecules into RNA families. However, the exi...

Descripción completa

Detalles Bibliográficos
Autores principales: Woźniak, Tomasz, Sajek, Małgorzata, Jaruzelska, Jadwiga, Sajek, Marcin Piotr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520625/
https://www.ncbi.nlm.nih.gov/pubmed/34656080
http://dx.doi.org/10.1186/s12859-021-04426-8
_version_ 1784584708251713536
author Woźniak, Tomasz
Sajek, Małgorzata
Jaruzelska, Jadwiga
Sajek, Marcin Piotr
author_facet Woźniak, Tomasz
Sajek, Małgorzata
Jaruzelska, Jadwiga
Sajek, Marcin Piotr
author_sort Woźniak, Tomasz
collection PubMed
description BACKGROUND: The functions of RNA molecules are mainly determined by their secondary structures. These functions can also be predicted using bioinformatic tools that enable the alignment of multiple RNAs to determine functional domains and/or classify RNA molecules into RNA families. However, the existing multiple RNA alignment tools, which use structural information, are slow in aligning long molecules and/or a large number of molecules. Therefore, a more rapid tool for multiple RNA alignment may improve the classification of known RNAs and help to reveal the functions of newly discovered RNAs. RESULTS: Here, we introduce an extremely fast Python-based tool called RNAlign2D. It converts RNA sequences to pseudo-amino acid sequences, which incorporate structural information, and uses a customizable scoring matrix to align these RNA molecules via the multiple protein sequence alignment tool MUSCLE. CONCLUSIONS: RNAlign2D produces accurate RNA alignments in a very short time. The pseudo-amino acid substitution matrix approach utilized in RNAlign2D is applicable for virtually all protein aligners. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04426-8.
format Online
Article
Text
id pubmed-8520625
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85206252021-10-20 RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix Woźniak, Tomasz Sajek, Małgorzata Jaruzelska, Jadwiga Sajek, Marcin Piotr BMC Bioinformatics Software BACKGROUND: The functions of RNA molecules are mainly determined by their secondary structures. These functions can also be predicted using bioinformatic tools that enable the alignment of multiple RNAs to determine functional domains and/or classify RNA molecules into RNA families. However, the existing multiple RNA alignment tools, which use structural information, are slow in aligning long molecules and/or a large number of molecules. Therefore, a more rapid tool for multiple RNA alignment may improve the classification of known RNAs and help to reveal the functions of newly discovered RNAs. RESULTS: Here, we introduce an extremely fast Python-based tool called RNAlign2D. It converts RNA sequences to pseudo-amino acid sequences, which incorporate structural information, and uses a customizable scoring matrix to align these RNA molecules via the multiple protein sequence alignment tool MUSCLE. CONCLUSIONS: RNAlign2D produces accurate RNA alignments in a very short time. The pseudo-amino acid substitution matrix approach utilized in RNAlign2D is applicable for virtually all protein aligners. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04426-8. BioMed Central 2021-10-16 /pmc/articles/PMC8520625/ /pubmed/34656080 http://dx.doi.org/10.1186/s12859-021-04426-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Woźniak, Tomasz
Sajek, Małgorzata
Jaruzelska, Jadwiga
Sajek, Marcin Piotr
RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix
title RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix
title_full RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix
title_fullStr RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix
title_full_unstemmed RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix
title_short RNAlign2D: a rapid method for combined RNA structure and sequence-based alignment using a pseudo-amino acid substitution matrix
title_sort rnalign2d: a rapid method for combined rna structure and sequence-based alignment using a pseudo-amino acid substitution matrix
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520625/
https://www.ncbi.nlm.nih.gov/pubmed/34656080
http://dx.doi.org/10.1186/s12859-021-04426-8
work_keys_str_mv AT wozniaktomasz rnalign2darapidmethodforcombinedrnastructureandsequencebasedalignmentusingapseudoaminoacidsubstitutionmatrix
AT sajekmałgorzata rnalign2darapidmethodforcombinedrnastructureandsequencebasedalignmentusingapseudoaminoacidsubstitutionmatrix
AT jaruzelskajadwiga rnalign2darapidmethodforcombinedrnastructureandsequencebasedalignmentusingapseudoaminoacidsubstitutionmatrix
AT sajekmarcinpiotr rnalign2darapidmethodforcombinedrnastructureandsequencebasedalignmentusingapseudoaminoacidsubstitutionmatrix