Cargando…

Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture

BACKGROUND: Circular RNAs (circRNAs) are widely expressed in cells and tissues and are involved in biological processes and human diseases. Recent studies have demonstrated that circRNAs can interact with RNA-binding proteins (RBPs), which is considered an important aspect for investigating the func...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhengfeng, Lei, Xiujuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7792089/
https://www.ncbi.nlm.nih.gov/pubmed/33413092
http://dx.doi.org/10.1186/s12859-020-03942-3
_version_ 1783633731339157504
author Wang, Zhengfeng
Lei, Xiujuan
author_facet Wang, Zhengfeng
Lei, Xiujuan
author_sort Wang, Zhengfeng
collection PubMed
description BACKGROUND: Circular RNAs (circRNAs) are widely expressed in cells and tissues and are involved in biological processes and human diseases. Recent studies have demonstrated that circRNAs can interact with RNA-binding proteins (RBPs), which is considered an important aspect for investigating the function of circRNAs. RESULTS: In this study, we design a slight variant of the capsule network, called circRB, to identify the sequence specificities of circRNAs binding to RBPs. In this model, the sequence features of circRNAs are extracted by convolution operations, and then, two dynamic routing algorithms in a capsule network are employed to discriminate between different binding sites by analysing the convolution features of binding sites. The experimental results show that the circRB method outperforms the existing computational methods. Afterwards, the trained models are applied to detect the sequence motifs on the seven circRNA-RBP bound sequence datasets and matched to known human RNA motifs. Some motifs on circular RNAs overlap with those on linear RNAs. Finally, we also predict binding sites on the reported full-length sequences of circRNAs interacting with RBPs, attempting to assist current studies. We hope that our model will contribute to better understanding the mechanisms of the interactions between RBPs and circRNAs. CONCLUSION: In view of the poor studies about the sequence specificities of circRNA-binding proteins, we designed a classification framework called circRB based on the capsule network. The results show that the circRB method is an effective method, and it achieves higher prediction accuracy than other methods.
format Online
Article
Text
id pubmed-7792089
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77920892021-01-11 Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture Wang, Zhengfeng Lei, Xiujuan BMC Bioinformatics Methodology Article BACKGROUND: Circular RNAs (circRNAs) are widely expressed in cells and tissues and are involved in biological processes and human diseases. Recent studies have demonstrated that circRNAs can interact with RNA-binding proteins (RBPs), which is considered an important aspect for investigating the function of circRNAs. RESULTS: In this study, we design a slight variant of the capsule network, called circRB, to identify the sequence specificities of circRNAs binding to RBPs. In this model, the sequence features of circRNAs are extracted by convolution operations, and then, two dynamic routing algorithms in a capsule network are employed to discriminate between different binding sites by analysing the convolution features of binding sites. The experimental results show that the circRB method outperforms the existing computational methods. Afterwards, the trained models are applied to detect the sequence motifs on the seven circRNA-RBP bound sequence datasets and matched to known human RNA motifs. Some motifs on circular RNAs overlap with those on linear RNAs. Finally, we also predict binding sites on the reported full-length sequences of circRNAs interacting with RBPs, attempting to assist current studies. We hope that our model will contribute to better understanding the mechanisms of the interactions between RBPs and circRNAs. CONCLUSION: In view of the poor studies about the sequence specificities of circRNA-binding proteins, we designed a classification framework called circRB based on the capsule network. The results show that the circRB method is an effective method, and it achieves higher prediction accuracy than other methods. BioMed Central 2021-01-07 /pmc/articles/PMC7792089/ /pubmed/33413092 http://dx.doi.org/10.1186/s12859-020-03942-3 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Wang, Zhengfeng
Lei, Xiujuan
Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
title Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
title_full Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
title_fullStr Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
title_full_unstemmed Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
title_short Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
title_sort identifying the sequence specificities of circrna-binding proteins based on a capsule network architecture
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7792089/
https://www.ncbi.nlm.nih.gov/pubmed/33413092
http://dx.doi.org/10.1186/s12859-020-03942-3
work_keys_str_mv AT wangzhengfeng identifyingthesequencespecificitiesofcircrnabindingproteinsbasedonacapsulenetworkarchitecture
AT leixiujuan identifyingthesequencespecificitiesofcircrnabindingproteinsbasedonacapsulenetworkarchitecture