Cargando…

Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks

BACKGROUND: Circular RNAs (CircRNAs) play critical roles in gene expression regulation and disease development. Understanding the regulation mechanism of CircRNAs formation can help reveal the role of CircRNAs in various biological processes mentioned above. Back-splicing is important for CircRNAs f...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Zhen, Shao, Yan Ling, Liu, Wei, Zhang, Qinhu, Yuan, Lin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9373444/
https://www.ncbi.nlm.nih.gov/pubmed/35962324
http://dx.doi.org/10.1186/s12864-022-08820-1
_version_ 1784767598457520128
author Shen, Zhen
Shao, Yan Ling
Liu, Wei
Zhang, Qinhu
Yuan, Lin
author_facet Shen, Zhen
Shao, Yan Ling
Liu, Wei
Zhang, Qinhu
Yuan, Lin
author_sort Shen, Zhen
collection PubMed
description BACKGROUND: Circular RNAs (CircRNAs) play critical roles in gene expression regulation and disease development. Understanding the regulation mechanism of CircRNAs formation can help reveal the role of CircRNAs in various biological processes mentioned above. Back-splicing is important for CircRNAs formation. Back-splicing sites prediction helps uncover the mysteries of CircRNAs formation. Several methods were proposed for back-splicing sites prediction or circRNA-realted prediction tasks. Model performance was constrained by poor feature learning and using ability. RESULTS: In this study, CircCNN was proposed to predict pre-mRNA back-splicing sites. Convolution neural network and batch normalization are the main parts of CircCNN. Experimental results on three datasets show that CircCNN outperforms other baseline models. Moreover, PPM (Position Probability Matrix) features extract by CircCNN were converted as motifs. Further analysis reveals that some of motifs found by CircCNN match known motifs involved in gene expression regulation, the distribution of motif and special short sequence is important for pre-mRNA back-splicing. CONCLUSIONS: In general, the findings in this study provide a new direction for exploring CircRNA-related gene expression regulatory mechanism and identifying potential targets for complex malignant diseases. The datasets and source code of this study are freely available at: https://github.com/szhh521/CircCNN. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08820-1.
format Online
Article
Text
id pubmed-9373444
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93734442022-08-13 Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks Shen, Zhen Shao, Yan Ling Liu, Wei Zhang, Qinhu Yuan, Lin BMC Genomics Research BACKGROUND: Circular RNAs (CircRNAs) play critical roles in gene expression regulation and disease development. Understanding the regulation mechanism of CircRNAs formation can help reveal the role of CircRNAs in various biological processes mentioned above. Back-splicing is important for CircRNAs formation. Back-splicing sites prediction helps uncover the mysteries of CircRNAs formation. Several methods were proposed for back-splicing sites prediction or circRNA-realted prediction tasks. Model performance was constrained by poor feature learning and using ability. RESULTS: In this study, CircCNN was proposed to predict pre-mRNA back-splicing sites. Convolution neural network and batch normalization are the main parts of CircCNN. Experimental results on three datasets show that CircCNN outperforms other baseline models. Moreover, PPM (Position Probability Matrix) features extract by CircCNN were converted as motifs. Further analysis reveals that some of motifs found by CircCNN match known motifs involved in gene expression regulation, the distribution of motif and special short sequence is important for pre-mRNA back-splicing. CONCLUSIONS: In general, the findings in this study provide a new direction for exploring CircRNA-related gene expression regulatory mechanism and identifying potential targets for complex malignant diseases. The datasets and source code of this study are freely available at: https://github.com/szhh521/CircCNN. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08820-1. BioMed Central 2022-08-12 /pmc/articles/PMC9373444/ /pubmed/35962324 http://dx.doi.org/10.1186/s12864-022-08820-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Shen, Zhen
Shao, Yan Ling
Liu, Wei
Zhang, Qinhu
Yuan, Lin
Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks
title Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks
title_full Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks
title_fullStr Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks
title_full_unstemmed Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks
title_short Prediction of Back-splicing sites for CircRNA formation based on convolutional neural networks
title_sort prediction of back-splicing sites for circrna formation based on convolutional neural networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9373444/
https://www.ncbi.nlm.nih.gov/pubmed/35962324
http://dx.doi.org/10.1186/s12864-022-08820-1
work_keys_str_mv AT shenzhen predictionofbacksplicingsitesforcircrnaformationbasedonconvolutionalneuralnetworks
AT shaoyanling predictionofbacksplicingsitesforcircrnaformationbasedonconvolutionalneuralnetworks
AT liuwei predictionofbacksplicingsitesforcircrnaformationbasedonconvolutionalneuralnetworks
AT zhangqinhu predictionofbacksplicingsitesforcircrnaformationbasedonconvolutionalneuralnetworks
AT yuanlin predictionofbacksplicingsitesforcircrnaformationbasedonconvolutionalneuralnetworks