Cargando…

ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs

BACKGROUND: Many novel long noncoding RNAs have been discovered in recent years due to advances in high-throughput sequencing experiments. Finding orthologues of these novel lncRNAs might facilitate clarification of their functional role in living organisms. However, lncRNAs exhibit low sequence con...

Descripción completa

Detalles Bibliográficos
Autores principales: Mylarshchikov, Dmitry Evgenevich, Mironov, Andrey Alexandrovich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487038/
https://www.ncbi.nlm.nih.gov/pubmed/36123626
http://dx.doi.org/10.1186/s12859-022-04929-y
_version_ 1784792408427331584
author Mylarshchikov, Dmitry Evgenevich
Mironov, Andrey Alexandrovich
author_facet Mylarshchikov, Dmitry Evgenevich
Mironov, Andrey Alexandrovich
author_sort Mylarshchikov, Dmitry Evgenevich
collection PubMed
description BACKGROUND: Many novel long noncoding RNAs have been discovered in recent years due to advances in high-throughput sequencing experiments. Finding orthologues of these novel lncRNAs might facilitate clarification of their functional role in living organisms. However, lncRNAs exhibit low sequence conservation, so specific methods for enhancing the signal-to-noise ratio were developed. Nevertheless, current methods such as transcriptomes comparison approaches or searches for conserved secondary structures are not applicable to novel, previously unannotated lncRNAs by design. RESULTS: We present ortho2align—a versatile sensitive synteny-based lncRNA orthologue search tool with statistical assessment of sequence conservation. This tool allows control of the specificity of the search process and optional annotation of found orthologues. ortho2align shows similar performance in terms of sensitivity and resource usage as the state-of-the-art method for aligning orthologous lncRNAs but also enables scientists to predict unannotated orthologous sequences for lncRNAs in question. Using ortho2align, we predicted orthologues of three distinct classes of novel human lncRNAs in six Vertebrata species to estimate their degree of conservation. CONCLUSIONS: Being designed for the discovery of unannotated orthologues of novel lncRNAs in distant species, ortho2align is a versatile tool applicable to any genomic regions, especially weakly conserved ones. A small amount of input files makes ortho2align easy to use in orthology studies as a single tool or in bundle with other steps that researchers will consider sensible. ortho2align is available as an Anaconda package with its source code hosted at https://github.com/dmitrymyl/ortho2align. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04929-y.
format Online
Article
Text
id pubmed-9487038
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94870382022-09-21 ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs Mylarshchikov, Dmitry Evgenevich Mironov, Andrey Alexandrovich BMC Bioinformatics Software BACKGROUND: Many novel long noncoding RNAs have been discovered in recent years due to advances in high-throughput sequencing experiments. Finding orthologues of these novel lncRNAs might facilitate clarification of their functional role in living organisms. However, lncRNAs exhibit low sequence conservation, so specific methods for enhancing the signal-to-noise ratio were developed. Nevertheless, current methods such as transcriptomes comparison approaches or searches for conserved secondary structures are not applicable to novel, previously unannotated lncRNAs by design. RESULTS: We present ortho2align—a versatile sensitive synteny-based lncRNA orthologue search tool with statistical assessment of sequence conservation. This tool allows control of the specificity of the search process and optional annotation of found orthologues. ortho2align shows similar performance in terms of sensitivity and resource usage as the state-of-the-art method for aligning orthologous lncRNAs but also enables scientists to predict unannotated orthologous sequences for lncRNAs in question. Using ortho2align, we predicted orthologues of three distinct classes of novel human lncRNAs in six Vertebrata species to estimate their degree of conservation. CONCLUSIONS: Being designed for the discovery of unannotated orthologues of novel lncRNAs in distant species, ortho2align is a versatile tool applicable to any genomic regions, especially weakly conserved ones. A small amount of input files makes ortho2align easy to use in orthology studies as a single tool or in bundle with other steps that researchers will consider sensible. ortho2align is available as an Anaconda package with its source code hosted at https://github.com/dmitrymyl/ortho2align. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04929-y. BioMed Central 2022-09-19 /pmc/articles/PMC9487038/ /pubmed/36123626 http://dx.doi.org/10.1186/s12859-022-04929-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Mylarshchikov, Dmitry Evgenevich
Mironov, Andrey Alexandrovich
ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs
title ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs
title_full ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs
title_fullStr ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs
title_full_unstemmed ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs
title_short ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs
title_sort ortho2align: a sensitive approach for searching for orthologues of novel lncrnas
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487038/
https://www.ncbi.nlm.nih.gov/pubmed/36123626
http://dx.doi.org/10.1186/s12859-022-04929-y
work_keys_str_mv AT mylarshchikovdmitryevgenevich ortho2alignasensitiveapproachforsearchingfororthologuesofnovellncrnas
AT mironovandreyalexandrovich ortho2alignasensitiveapproachforsearchingfororthologuesofnovellncrnas