Cargando…

CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks

The interactions between RNAs and RNA binding proteins (RBPs) are crucial for understanding post-transcriptional regulation mechanisms. A lot of computational tools have been developed to automatically predict the binding relationship between RNAs and RBPs. However, most of the methods can only pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Ju, Yuqi, Yuan, Liangliang, Yang, Yang, Zhao, Hai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886371/
https://www.ncbi.nlm.nih.gov/pubmed/31824574
http://dx.doi.org/10.3389/fgene.2019.01184
_version_ 1783474867391168512
author Ju, Yuqi
Yuan, Liangliang
Yang, Yang
Zhao, Hai
author_facet Ju, Yuqi
Yuan, Liangliang
Yang, Yang
Zhao, Hai
author_sort Ju, Yuqi
collection PubMed
description The interactions between RNAs and RNA binding proteins (RBPs) are crucial for understanding post-transcriptional regulation mechanisms. A lot of computational tools have been developed to automatically predict the binding relationship between RNAs and RBPs. However, most of the methods can only predict the presence or absence of binding sites for a sequence fragment, without providing specific information on the position or length of the binding sites. Besides, the existing tools focus on the interaction between RBPs and linear RNAs, while the binding sites on circular RNAs (circRNAs) have been rarely studied. In this study, we model the prediction of binding sites on RNAs as a sequence labeling problem, and propose a new model called circSLNN to identify the specific location of RBP-binding sites on circRNAs. CircSLNN is driven by pretrained RNA embedding vectors and a composite labeling model. On our constructed circRNA datasets, our model has an average F (1) score of 0.790. We assess the performance on full-length RNA sequences, the proposed model outperforms previous classification-based models by a large margin.
format Online
Article
Text
id pubmed-6886371
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68863712019-12-10 CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks Ju, Yuqi Yuan, Liangliang Yang, Yang Zhao, Hai Front Genet Genetics The interactions between RNAs and RNA binding proteins (RBPs) are crucial for understanding post-transcriptional regulation mechanisms. A lot of computational tools have been developed to automatically predict the binding relationship between RNAs and RBPs. However, most of the methods can only predict the presence or absence of binding sites for a sequence fragment, without providing specific information on the position or length of the binding sites. Besides, the existing tools focus on the interaction between RBPs and linear RNAs, while the binding sites on circular RNAs (circRNAs) have been rarely studied. In this study, we model the prediction of binding sites on RNAs as a sequence labeling problem, and propose a new model called circSLNN to identify the specific location of RBP-binding sites on circRNAs. CircSLNN is driven by pretrained RNA embedding vectors and a composite labeling model. On our constructed circRNA datasets, our model has an average F (1) score of 0.790. We assess the performance on full-length RNA sequences, the proposed model outperforms previous classification-based models by a large margin. Frontiers Media S.A. 2019-11-22 /pmc/articles/PMC6886371/ /pubmed/31824574 http://dx.doi.org/10.3389/fgene.2019.01184 Text en Copyright © 2019 Ju, Yuan, Yang and Zhao http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ju, Yuqi
Yuan, Liangliang
Yang, Yang
Zhao, Hai
CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
title CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
title_full CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
title_fullStr CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
title_full_unstemmed CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
title_short CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
title_sort circslnn: identifying rbp-binding sites on circrnas via sequence labeling neural networks
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886371/
https://www.ncbi.nlm.nih.gov/pubmed/31824574
http://dx.doi.org/10.3389/fgene.2019.01184
work_keys_str_mv AT juyuqi circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks
AT yuanliangliang circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks
AT yangyang circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks
AT zhaohai circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks