Cargando…
Automatic generation of pseudoknotted RNAs taxonomy
BACKGROUND: The ability to compare RNA secondary structures is important in understanding their biological function and for grouping similar organisms into families by looking at evolutionarily conserved sequences such as 16S rRNA. Most comparison methods and benchmarks in the literature focus on ps...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10268371/ https://www.ncbi.nlm.nih.gov/pubmed/37322429 http://dx.doi.org/10.1186/s12859-023-05362-5 |
_version_ | 1785059078309937152 |
---|---|
author | Quadrini, Michela Tesei, Luca Merelli, Emanuela |
author_facet | Quadrini, Michela Tesei, Luca Merelli, Emanuela |
author_sort | Quadrini, Michela |
collection | PubMed |
description | BACKGROUND: The ability to compare RNA secondary structures is important in understanding their biological function and for grouping similar organisms into families by looking at evolutionarily conserved sequences such as 16S rRNA. Most comparison methods and benchmarks in the literature focus on pseudoknot-free structures due to the difficulty of mapping pseudoknots in classical tree representations. Some approaches exist that permit to cluster pseudoknotted RNAs but there is not a general framework for evaluating their performance. RESULTS: We introduce an evaluation framework based on a similarity/dissimilarity measure obtained by a comparison method and agglomerative clustering. Their combination automatically partition a set of molecules into groups. To illustrate the framework we define and make available a benchmark of pseudoknotted (16S and 23S) and pseudoknot-free (5S) rRNA secondary structures belonging to Archaea, Bacteria and Eukaryota. We also consider five different comparison methods from the literature that are able to manage pseudoknots. For each method we clusterize the molecules in the benchmark to obtain the taxa at the rank phylum according to the European Nucleotide Archive curated taxonomy. We compute appropriate metrics for each method and we compare their suitability to reconstruct the taxa. |
format | Online Article Text |
id | pubmed-10268371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-102683712023-06-15 Automatic generation of pseudoknotted RNAs taxonomy Quadrini, Michela Tesei, Luca Merelli, Emanuela BMC Bioinformatics Research BACKGROUND: The ability to compare RNA secondary structures is important in understanding their biological function and for grouping similar organisms into families by looking at evolutionarily conserved sequences such as 16S rRNA. Most comparison methods and benchmarks in the literature focus on pseudoknot-free structures due to the difficulty of mapping pseudoknots in classical tree representations. Some approaches exist that permit to cluster pseudoknotted RNAs but there is not a general framework for evaluating their performance. RESULTS: We introduce an evaluation framework based on a similarity/dissimilarity measure obtained by a comparison method and agglomerative clustering. Their combination automatically partition a set of molecules into groups. To illustrate the framework we define and make available a benchmark of pseudoknotted (16S and 23S) and pseudoknot-free (5S) rRNA secondary structures belonging to Archaea, Bacteria and Eukaryota. We also consider five different comparison methods from the literature that are able to manage pseudoknots. For each method we clusterize the molecules in the benchmark to obtain the taxa at the rank phylum according to the European Nucleotide Archive curated taxonomy. We compute appropriate metrics for each method and we compare their suitability to reconstruct the taxa. BioMed Central 2023-06-15 /pmc/articles/PMC10268371/ /pubmed/37322429 http://dx.doi.org/10.1186/s12859-023-05362-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Quadrini, Michela Tesei, Luca Merelli, Emanuela Automatic generation of pseudoknotted RNAs taxonomy |
title | Automatic generation of pseudoknotted RNAs taxonomy |
title_full | Automatic generation of pseudoknotted RNAs taxonomy |
title_fullStr | Automatic generation of pseudoknotted RNAs taxonomy |
title_full_unstemmed | Automatic generation of pseudoknotted RNAs taxonomy |
title_short | Automatic generation of pseudoknotted RNAs taxonomy |
title_sort | automatic generation of pseudoknotted rnas taxonomy |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10268371/ https://www.ncbi.nlm.nih.gov/pubmed/37322429 http://dx.doi.org/10.1186/s12859-023-05362-5 |
work_keys_str_mv | AT quadrinimichela automaticgenerationofpseudoknottedrnastaxonomy AT teseiluca automaticgenerationofpseudoknottedrnastaxonomy AT merelliemanuela automaticgenerationofpseudoknottedrnastaxonomy |