Cargando…
Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture
BACKGROUND: Circular RNAs (circRNAs) are widely expressed in cells and tissues and are involved in biological processes and human diseases. Recent studies have demonstrated that circRNAs can interact with RNA-binding proteins (RBPs), which is considered an important aspect for investigating the func...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7792089/ https://www.ncbi.nlm.nih.gov/pubmed/33413092 http://dx.doi.org/10.1186/s12859-020-03942-3 |
_version_ | 1783633731339157504 |
---|---|
author | Wang, Zhengfeng Lei, Xiujuan |
author_facet | Wang, Zhengfeng Lei, Xiujuan |
author_sort | Wang, Zhengfeng |
collection | PubMed |
description | BACKGROUND: Circular RNAs (circRNAs) are widely expressed in cells and tissues and are involved in biological processes and human diseases. Recent studies have demonstrated that circRNAs can interact with RNA-binding proteins (RBPs), which is considered an important aspect for investigating the function of circRNAs. RESULTS: In this study, we design a slight variant of the capsule network, called circRB, to identify the sequence specificities of circRNAs binding to RBPs. In this model, the sequence features of circRNAs are extracted by convolution operations, and then, two dynamic routing algorithms in a capsule network are employed to discriminate between different binding sites by analysing the convolution features of binding sites. The experimental results show that the circRB method outperforms the existing computational methods. Afterwards, the trained models are applied to detect the sequence motifs on the seven circRNA-RBP bound sequence datasets and matched to known human RNA motifs. Some motifs on circular RNAs overlap with those on linear RNAs. Finally, we also predict binding sites on the reported full-length sequences of circRNAs interacting with RBPs, attempting to assist current studies. We hope that our model will contribute to better understanding the mechanisms of the interactions between RBPs and circRNAs. CONCLUSION: In view of the poor studies about the sequence specificities of circRNA-binding proteins, we designed a classification framework called circRB based on the capsule network. The results show that the circRB method is an effective method, and it achieves higher prediction accuracy than other methods. |
format | Online Article Text |
id | pubmed-7792089 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-77920892021-01-11 Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture Wang, Zhengfeng Lei, Xiujuan BMC Bioinformatics Methodology Article BACKGROUND: Circular RNAs (circRNAs) are widely expressed in cells and tissues and are involved in biological processes and human diseases. Recent studies have demonstrated that circRNAs can interact with RNA-binding proteins (RBPs), which is considered an important aspect for investigating the function of circRNAs. RESULTS: In this study, we design a slight variant of the capsule network, called circRB, to identify the sequence specificities of circRNAs binding to RBPs. In this model, the sequence features of circRNAs are extracted by convolution operations, and then, two dynamic routing algorithms in a capsule network are employed to discriminate between different binding sites by analysing the convolution features of binding sites. The experimental results show that the circRB method outperforms the existing computational methods. Afterwards, the trained models are applied to detect the sequence motifs on the seven circRNA-RBP bound sequence datasets and matched to known human RNA motifs. Some motifs on circular RNAs overlap with those on linear RNAs. Finally, we also predict binding sites on the reported full-length sequences of circRNAs interacting with RBPs, attempting to assist current studies. We hope that our model will contribute to better understanding the mechanisms of the interactions between RBPs and circRNAs. CONCLUSION: In view of the poor studies about the sequence specificities of circRNA-binding proteins, we designed a classification framework called circRB based on the capsule network. The results show that the circRB method is an effective method, and it achieves higher prediction accuracy than other methods. BioMed Central 2021-01-07 /pmc/articles/PMC7792089/ /pubmed/33413092 http://dx.doi.org/10.1186/s12859-020-03942-3 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Wang, Zhengfeng Lei, Xiujuan Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture |
title | Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture |
title_full | Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture |
title_fullStr | Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture |
title_full_unstemmed | Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture |
title_short | Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture |
title_sort | identifying the sequence specificities of circrna-binding proteins based on a capsule network architecture |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7792089/ https://www.ncbi.nlm.nih.gov/pubmed/33413092 http://dx.doi.org/10.1186/s12859-020-03942-3 |
work_keys_str_mv | AT wangzhengfeng identifyingthesequencespecificitiesofcircrnabindingproteinsbasedonacapsulenetworkarchitecture AT leixiujuan identifyingthesequencespecificitiesofcircrnabindingproteinsbasedonacapsulenetworkarchitecture |