Cargando…

Data of protein-RNA binding sites

Despite the increasing number of protein-RNA complexes in structure databases, few data resources have been made available which can be readily used in developing or testing a method for predicting either protein-binding sites in RNA sequences or RNA-binding sites in protein sequences. The problem o...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Wook, Park, Byungkyu, Choi, Daesik, Han, Kyungsook
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5219607/
https://www.ncbi.nlm.nih.gov/pubmed/28070546
http://dx.doi.org/10.1016/j.dib.2016.12.041
_version_ 1782492484757094400
author Lee, Wook
Park, Byungkyu
Choi, Daesik
Han, Kyungsook
author_facet Lee, Wook
Park, Byungkyu
Choi, Daesik
Han, Kyungsook
author_sort Lee, Wook
collection PubMed
description Despite the increasing number of protein-RNA complexes in structure databases, few data resources have been made available which can be readily used in developing or testing a method for predicting either protein-binding sites in RNA sequences or RNA-binding sites in protein sequences. The problem of predicting protein-binding sites in RNA has received much less attention than the problem of predicting RNA-binding sites in protein. The data presented in this paper are related to the article entitled “PRIdictor: Protein-RNA Interaction predictor” (Tuvshinjargal et al. 2016) [1]. PRIdictor can predict protein-binding sites in RNA as well as RNA-binding sites in protein at the nucleotide- and residue-levels. This paper presents four datasets that were used to test four prediction models of PRIdictor: (1) model RP for predicting protein-binding sites in RNA from protein and RNA sequences, (2) model RaP for predicting protein-binding sites in RNA from RNA sequence alone, (3) model PR for predicting RNA-binding sites in protein from protein and RNA sequences, and (4) model PaR for predicting RNA-binding sites in protein from protein sequence alone. The datasets supplied in this article can be used as a valuable resource to evaluate and compare different methods for predicting protein-RNA binding sites.
format Online
Article
Text
id pubmed-5219607
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-52196072017-01-09 Data of protein-RNA binding sites Lee, Wook Park, Byungkyu Choi, Daesik Han, Kyungsook Data Brief Data Article Despite the increasing number of protein-RNA complexes in structure databases, few data resources have been made available which can be readily used in developing or testing a method for predicting either protein-binding sites in RNA sequences or RNA-binding sites in protein sequences. The problem of predicting protein-binding sites in RNA has received much less attention than the problem of predicting RNA-binding sites in protein. The data presented in this paper are related to the article entitled “PRIdictor: Protein-RNA Interaction predictor” (Tuvshinjargal et al. 2016) [1]. PRIdictor can predict protein-binding sites in RNA as well as RNA-binding sites in protein at the nucleotide- and residue-levels. This paper presents four datasets that were used to test four prediction models of PRIdictor: (1) model RP for predicting protein-binding sites in RNA from protein and RNA sequences, (2) model RaP for predicting protein-binding sites in RNA from RNA sequence alone, (3) model PR for predicting RNA-binding sites in protein from protein and RNA sequences, and (4) model PaR for predicting RNA-binding sites in protein from protein sequence alone. The datasets supplied in this article can be used as a valuable resource to evaluate and compare different methods for predicting protein-RNA binding sites. Elsevier 2016-12-29 /pmc/articles/PMC5219607/ /pubmed/28070546 http://dx.doi.org/10.1016/j.dib.2016.12.041 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Lee, Wook
Park, Byungkyu
Choi, Daesik
Han, Kyungsook
Data of protein-RNA binding sites
title Data of protein-RNA binding sites
title_full Data of protein-RNA binding sites
title_fullStr Data of protein-RNA binding sites
title_full_unstemmed Data of protein-RNA binding sites
title_short Data of protein-RNA binding sites
title_sort data of protein-rna binding sites
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5219607/
https://www.ncbi.nlm.nih.gov/pubmed/28070546
http://dx.doi.org/10.1016/j.dib.2016.12.041
work_keys_str_mv AT leewook dataofproteinrnabindingsites
AT parkbyungkyu dataofproteinrnabindingsites
AT choidaesik dataofproteinrnabindingsites
AT hankyungsook dataofproteinrnabindingsites