Cargando…

RNAsolo: a repository of cleaned PDB-derived RNA 3D structures

MOTIVATION: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Adamczyk, Bartosz, Antczak, Maciej, Szachniuk, Marta
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9272803/
https://www.ncbi.nlm.nih.gov/pubmed/35674373
http://dx.doi.org/10.1093/bioinformatics/btac386
Descripción
Sumario:MOTIVATION: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource with experimentally determined structures is the Protein Data Bank. However, files in this archive often contain other molecular data in addition to the RNA structure itself, which—to be used by RNA processing algorithms—should be removed. RESULTS: RNAsolo is a self-updating database dedicated to RNA bioinformatics. It systematically collects experimentally determined RNA 3D structures stored in the PDB, cleans them from non-RNA chains, and groups them into equivalence classes. It allows users to download various subsets of data—clustered by resolution, source, data format, etc.—for further processing and analysis with a single click. AVAILABILITY AND IMPLEMENTATION: The repository is publicly available at https://rnasolo.cs.put.poznan.pl.