Cargando…
CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks
The interactions between RNAs and RNA binding proteins (RBPs) are crucial for understanding post-transcriptional regulation mechanisms. A lot of computational tools have been developed to automatically predict the binding relationship between RNAs and RBPs. However, most of the methods can only pred...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886371/ https://www.ncbi.nlm.nih.gov/pubmed/31824574 http://dx.doi.org/10.3389/fgene.2019.01184 |
_version_ | 1783474867391168512 |
---|---|
author | Ju, Yuqi Yuan, Liangliang Yang, Yang Zhao, Hai |
author_facet | Ju, Yuqi Yuan, Liangliang Yang, Yang Zhao, Hai |
author_sort | Ju, Yuqi |
collection | PubMed |
description | The interactions between RNAs and RNA binding proteins (RBPs) are crucial for understanding post-transcriptional regulation mechanisms. A lot of computational tools have been developed to automatically predict the binding relationship between RNAs and RBPs. However, most of the methods can only predict the presence or absence of binding sites for a sequence fragment, without providing specific information on the position or length of the binding sites. Besides, the existing tools focus on the interaction between RBPs and linear RNAs, while the binding sites on circular RNAs (circRNAs) have been rarely studied. In this study, we model the prediction of binding sites on RNAs as a sequence labeling problem, and propose a new model called circSLNN to identify the specific location of RBP-binding sites on circRNAs. CircSLNN is driven by pretrained RNA embedding vectors and a composite labeling model. On our constructed circRNA datasets, our model has an average F (1) score of 0.790. We assess the performance on full-length RNA sequences, the proposed model outperforms previous classification-based models by a large margin. |
format | Online Article Text |
id | pubmed-6886371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68863712019-12-10 CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks Ju, Yuqi Yuan, Liangliang Yang, Yang Zhao, Hai Front Genet Genetics The interactions between RNAs and RNA binding proteins (RBPs) are crucial for understanding post-transcriptional regulation mechanisms. A lot of computational tools have been developed to automatically predict the binding relationship between RNAs and RBPs. However, most of the methods can only predict the presence or absence of binding sites for a sequence fragment, without providing specific information on the position or length of the binding sites. Besides, the existing tools focus on the interaction between RBPs and linear RNAs, while the binding sites on circular RNAs (circRNAs) have been rarely studied. In this study, we model the prediction of binding sites on RNAs as a sequence labeling problem, and propose a new model called circSLNN to identify the specific location of RBP-binding sites on circRNAs. CircSLNN is driven by pretrained RNA embedding vectors and a composite labeling model. On our constructed circRNA datasets, our model has an average F (1) score of 0.790. We assess the performance on full-length RNA sequences, the proposed model outperforms previous classification-based models by a large margin. Frontiers Media S.A. 2019-11-22 /pmc/articles/PMC6886371/ /pubmed/31824574 http://dx.doi.org/10.3389/fgene.2019.01184 Text en Copyright © 2019 Ju, Yuan, Yang and Zhao http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Ju, Yuqi Yuan, Liangliang Yang, Yang Zhao, Hai CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks |
title | CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks |
title_full | CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks |
title_fullStr | CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks |
title_full_unstemmed | CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks |
title_short | CircSLNN: Identifying RBP-Binding Sites on circRNAs via Sequence Labeling Neural Networks |
title_sort | circslnn: identifying rbp-binding sites on circrnas via sequence labeling neural networks |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886371/ https://www.ncbi.nlm.nih.gov/pubmed/31824574 http://dx.doi.org/10.3389/fgene.2019.01184 |
work_keys_str_mv | AT juyuqi circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks AT yuanliangliang circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks AT yangyang circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks AT zhaohai circslnnidentifyingrbpbindingsitesoncircrnasviasequencelabelingneuralnetworks |