Cargando…

CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks

Circular RNAs (circRNAs), with their crucial roles in gene regulation and disease development, have become rising stars in the RNA world. To understand the regulatory function of circRNAs, many studies focus on the interactions between circRNAs and RNA-binding proteins (RBPs). Recently, the abundant...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Kaiming, Pan, Xiaoyong, Yang, Yang, Shen, Hong-Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6859861/
https://www.ncbi.nlm.nih.gov/pubmed/31537716
http://dx.doi.org/10.1261/rna.070565.119
_version_ 1783471203240902656
author Zhang, Kaiming
Pan, Xiaoyong
Yang, Yang
Shen, Hong-Bin
author_facet Zhang, Kaiming
Pan, Xiaoyong
Yang, Yang
Shen, Hong-Bin
author_sort Zhang, Kaiming
collection PubMed
description Circular RNAs (circRNAs), with their crucial roles in gene regulation and disease development, have become rising stars in the RNA world. To understand the regulatory function of circRNAs, many studies focus on the interactions between circRNAs and RNA-binding proteins (RBPs). Recently, the abundant CLIP-seq experimental data has enabled the large-scale identification and analysis of circRNA–RBP interactions, whereas, as far as we know, no computational tool based on machine learning has been proposed yet. We develop CRIP (CircRNAs Interact with Proteins) for the prediction of RBP-binding sites on circRNAs using RNA sequences alone. CRIP consists of a stacked codon-based encoding scheme and a hybrid deep learning architecture, in which a convolutional neural network (CNN) learns high-level abstract features and a recurrent neural network (RNN) learns long dependency in the sequences. We construct 37 data sets including sequence fragments of binding sites on circRNAs, and each set corresponds to an RBP. The experimental results show that the new encoding scheme is superior to the existing feature representation methods for RNA sequences, and the hybrid network outperforms conventional classifiers by a large margin, where both the CNN and RNN components contribute to the performance improvement.
format Online
Article
Text
id pubmed-6859861
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-68598612020-12-01 CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks Zhang, Kaiming Pan, Xiaoyong Yang, Yang Shen, Hong-Bin RNA Bioinformatics Circular RNAs (circRNAs), with their crucial roles in gene regulation and disease development, have become rising stars in the RNA world. To understand the regulatory function of circRNAs, many studies focus on the interactions between circRNAs and RNA-binding proteins (RBPs). Recently, the abundant CLIP-seq experimental data has enabled the large-scale identification and analysis of circRNA–RBP interactions, whereas, as far as we know, no computational tool based on machine learning has been proposed yet. We develop CRIP (CircRNAs Interact with Proteins) for the prediction of RBP-binding sites on circRNAs using RNA sequences alone. CRIP consists of a stacked codon-based encoding scheme and a hybrid deep learning architecture, in which a convolutional neural network (CNN) learns high-level abstract features and a recurrent neural network (RNN) learns long dependency in the sequences. We construct 37 data sets including sequence fragments of binding sites on circRNAs, and each set corresponds to an RBP. The experimental results show that the new encoding scheme is superior to the existing feature representation methods for RNA sequences, and the hybrid network outperforms conventional classifiers by a large margin, where both the CNN and RNN components contribute to the performance improvement. Cold Spring Harbor Laboratory Press 2019-12 /pmc/articles/PMC6859861/ /pubmed/31537716 http://dx.doi.org/10.1261/rna.070565.119 Text en © 2019 Zhang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by the RNA Society for the first 12 months after the full-issue publication date (see http://rnajournal.cshlp.org/site/misc/terms.xhtml). After 12 months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Bioinformatics
Zhang, Kaiming
Pan, Xiaoyong
Yang, Yang
Shen, Hong-Bin
CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks
title CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks
title_full CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks
title_fullStr CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks
title_full_unstemmed CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks
title_short CRIP: predicting circRNA–RBP-binding sites using a codon-based encoding and hybrid deep neural networks
title_sort crip: predicting circrna–rbp-binding sites using a codon-based encoding and hybrid deep neural networks
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6859861/
https://www.ncbi.nlm.nih.gov/pubmed/31537716
http://dx.doi.org/10.1261/rna.070565.119
work_keys_str_mv AT zhangkaiming crippredictingcircrnarbpbindingsitesusingacodonbasedencodingandhybriddeepneuralnetworks
AT panxiaoyong crippredictingcircrnarbpbindingsitesusingacodonbasedencodingandhybriddeepneuralnetworks
AT yangyang crippredictingcircrnarbpbindingsitesusingacodonbasedencodingandhybriddeepneuralnetworks
AT shenhongbin crippredictingcircrnarbpbindingsitesusingacodonbasedencodingandhybriddeepneuralnetworks