Cargando…

Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization

BACKGROUND: Protein-RNA interactions play key roles in many processes regulating gene expression. To understand the underlying binding preference, ultraviolet cross-linking and immunoprecipitation (CLIP)-based methods have been used to identify the binding sites for hundreds of RNA-binding proteins...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Shitao, Hamada, Michiaki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8594109/
https://www.ncbi.nlm.nih.gov/pubmed/34781902
http://dx.doi.org/10.1186/s12859-021-04430-y
_version_ 1784599899808989184
author Zhao, Shitao
Hamada, Michiaki
author_facet Zhao, Shitao
Hamada, Michiaki
author_sort Zhao, Shitao
collection PubMed
description BACKGROUND: Protein-RNA interactions play key roles in many processes regulating gene expression. To understand the underlying binding preference, ultraviolet cross-linking and immunoprecipitation (CLIP)-based methods have been used to identify the binding sites for hundreds of RNA-binding proteins (RBPs) in vivo. Using these large-scale experimental data to infer RNA binding preference and predict missing binding sites has become a great challenge. Some existing deep-learning models have demonstrated high prediction accuracy for individual RBPs. However, it remains difficult to avoid significant bias due to the experimental protocol. The DeepRiPe method was recently developed to solve this problem via introducing multi-task or multi-label learning into this field. However, this method has not reached an ideal level of prediction power due to the weak neural network architecture. RESULTS: Compared to the DeepRiPe approach, our Multi-resBind method demonstrated substantial improvements using the same large-scale PAR-CLIP dataset with respect to an increase in the area under the receiver operating characteristic curve and average precision. We conducted extensive experiments to evaluate the impact of various types of input data on the final prediction accuracy. The same approach was used to evaluate the effect of loss functions. Finally, a modified integrated gradient was employed to generate attribution maps. The patterns disentangled from relative contributions according to context offer biological insights into the underlying mechanism of protein-RNA interactions. CONCLUSIONS: Here, we propose Multi-resBind as a new multi-label deep-learning approach to infer protein-RNA binding preferences and predict novel interactions. The results clearly demonstrate that Multi-resBind is a promising tool to predict unknown binding sites in vivo and gain biology insights into why the neural network makes a given prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04430-y.
format Online
Article
Text
id pubmed-8594109
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85941092021-11-16 Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization Zhao, Shitao Hamada, Michiaki BMC Bioinformatics Research BACKGROUND: Protein-RNA interactions play key roles in many processes regulating gene expression. To understand the underlying binding preference, ultraviolet cross-linking and immunoprecipitation (CLIP)-based methods have been used to identify the binding sites for hundreds of RNA-binding proteins (RBPs) in vivo. Using these large-scale experimental data to infer RNA binding preference and predict missing binding sites has become a great challenge. Some existing deep-learning models have demonstrated high prediction accuracy for individual RBPs. However, it remains difficult to avoid significant bias due to the experimental protocol. The DeepRiPe method was recently developed to solve this problem via introducing multi-task or multi-label learning into this field. However, this method has not reached an ideal level of prediction power due to the weak neural network architecture. RESULTS: Compared to the DeepRiPe approach, our Multi-resBind method demonstrated substantial improvements using the same large-scale PAR-CLIP dataset with respect to an increase in the area under the receiver operating characteristic curve and average precision. We conducted extensive experiments to evaluate the impact of various types of input data on the final prediction accuracy. The same approach was used to evaluate the effect of loss functions. Finally, a modified integrated gradient was employed to generate attribution maps. The patterns disentangled from relative contributions according to context offer biological insights into the underlying mechanism of protein-RNA interactions. CONCLUSIONS: Here, we propose Multi-resBind as a new multi-label deep-learning approach to infer protein-RNA binding preferences and predict novel interactions. The results clearly demonstrate that Multi-resBind is a promising tool to predict unknown binding sites in vivo and gain biology insights into why the neural network makes a given prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04430-y. BioMed Central 2021-11-15 /pmc/articles/PMC8594109/ /pubmed/34781902 http://dx.doi.org/10.1186/s12859-021-04430-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhao, Shitao
Hamada, Michiaki
Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization
title Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization
title_full Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization
title_fullStr Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization
title_full_unstemmed Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization
title_short Multi-resBind: a residual network-based multi-label classifier for in vivo RNA binding prediction and preference visualization
title_sort multi-resbind: a residual network-based multi-label classifier for in vivo rna binding prediction and preference visualization
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8594109/
https://www.ncbi.nlm.nih.gov/pubmed/34781902
http://dx.doi.org/10.1186/s12859-021-04430-y
work_keys_str_mv AT zhaoshitao multiresbindaresidualnetworkbasedmultilabelclassifierforinvivornabindingpredictionandpreferencevisualization
AT hamadamichiaki multiresbindaresidualnetworkbasedmultilabelclassifierforinvivornabindingpredictionandpreferencevisualization