Cargando…

RNAsolo: a repository of cleaned PDB-derived RNA 3D structures

MOTIVATION: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Adamczyk, Bartosz, Antczak, Maciej, Szachniuk, Marta
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9272803/
https://www.ncbi.nlm.nih.gov/pubmed/35674373
http://dx.doi.org/10.1093/bioinformatics/btac386
_version_ 1784744947812925440
author Adamczyk, Bartosz
Antczak, Maciej
Szachniuk, Marta
author_facet Adamczyk, Bartosz
Antczak, Maciej
Szachniuk, Marta
author_sort Adamczyk, Bartosz
collection PubMed
description MOTIVATION: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource with experimentally determined structures is the Protein Data Bank. However, files in this archive often contain other molecular data in addition to the RNA structure itself, which—to be used by RNA processing algorithms—should be removed. RESULTS: RNAsolo is a self-updating database dedicated to RNA bioinformatics. It systematically collects experimentally determined RNA 3D structures stored in the PDB, cleans them from non-RNA chains, and groups them into equivalence classes. It allows users to download various subsets of data—clustered by resolution, source, data format, etc.—for further processing and analysis with a single click. AVAILABILITY AND IMPLEMENTATION: The repository is publicly available at https://rnasolo.cs.put.poznan.pl.
format Online
Article
Text
id pubmed-9272803
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92728032022-07-11 RNAsolo: a repository of cleaned PDB-derived RNA 3D structures Adamczyk, Bartosz Antczak, Maciej Szachniuk, Marta Bioinformatics Applications Notes MOTIVATION: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource with experimentally determined structures is the Protein Data Bank. However, files in this archive often contain other molecular data in addition to the RNA structure itself, which—to be used by RNA processing algorithms—should be removed. RESULTS: RNAsolo is a self-updating database dedicated to RNA bioinformatics. It systematically collects experimentally determined RNA 3D structures stored in the PDB, cleans them from non-RNA chains, and groups them into equivalence classes. It allows users to download various subsets of data—clustered by resolution, source, data format, etc.—for further processing and analysis with a single click. AVAILABILITY AND IMPLEMENTATION: The repository is publicly available at https://rnasolo.cs.put.poznan.pl. Oxford University Press 2022-06-08 /pmc/articles/PMC9272803/ /pubmed/35674373 http://dx.doi.org/10.1093/bioinformatics/btac386 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Adamczyk, Bartosz
Antczak, Maciej
Szachniuk, Marta
RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
title RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
title_full RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
title_fullStr RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
title_full_unstemmed RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
title_short RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
title_sort rnasolo: a repository of cleaned pdb-derived rna 3d structures
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9272803/
https://www.ncbi.nlm.nih.gov/pubmed/35674373
http://dx.doi.org/10.1093/bioinformatics/btac386
work_keys_str_mv AT adamczykbartosz rnasoloarepositoryofcleanedpdbderivedrna3dstructures
AT antczakmaciej rnasoloarepositoryofcleanedpdbderivedrna3dstructures
AT szachniukmarta rnasoloarepositoryofcleanedpdbderivedrna3dstructures